Michio Kaku 


Strings, 
Conformal 
Fields, and 
M-Theory 


Second Edition 



Springer 


Graduate Texts in Contemporary Physics 


Series Editors: 

R. Stephen Berry 
Joseph L. Birman 
Jeffrey W. Lynn 
Mark P. Silverman 
H. Eugene Stanley 
Mikhail Voloshin 


Springer Science+Business Media, LLC 



Graduate Texts in Contemporary Physics 


S.T. Ali, J.P. Antoine, and J.P. Gazeau: Coherent States and Their 
Generalizations: A Mathematical Overview 

A. Auerbach: Interacting Electrons and Quantum Magnetism 

B. Felsager: Geometry, Particles, and Fields 

P. Di Francesco, P. Mathieu, and D. Senechal: Conformal Field Theories 

J.H. Hinken: Superconductor Electronics: Fundamentals and 
Microwave Applications 

J. Hladik: Spinors in Physics 

Yu.M. Ivanchenko and A.A. Lisyansky: Physics of Critical Fluctuations 
M. Kaku: Introduction to Superstrings and M-Theory, 2nd Edition 
M. Kaku: Strings, Conformal Fields, and M-Theory, 2nd Edition 
H.V. Klapdor (ed.): Neutrinos 

J.W. Lynn (ed.): High-Temperature Superconductivity 

H.J. Metcalf and P. van der Straten: Laser Cooling and Trapping 

R.N. Mohapatra: Unification and Supersymmetry: The Frontiers of 
Quark-Lepton Physics, 2nd Edition 

H. Oberhummer: Nuclei in the Cosmos 

G.D. Phillies: Elementary Lectures in Statistical Mechanics 

R.E. Prange and S.M. Girvin (eds.): The Quantum Hall Effect 

B.M. Smirnov: Clusters and Small Particles: In Gases and Plasmas 

F.T. Vasko and A.V. Kuznetsov: Electronic States and Optical 
Transitions in Semiconductor Heterostructures 

A.M. Zagoskin: Quantum Theory of Many-Body Systems: Techniques and 
Applications 




Michio Kaku 


Strings, Conformal 
Fields, and M-Theory 

Second Edition 


With 48 Illustrations 


Springer 



Michio Kaku 
Department of Physics 
City College of the 

City University of New York 
New York, NY 10031 
USA 

kaku @ scisun. sci.ccny.cuny.edu 


Series Editors 
R. Stephen Berry 
Department of Chemistry 
University of Chicago 
Chicago, IL 60637 
USA 

Mark P. Silverman 
Department of Physics 
Trinity College 
Hartford, CT 06106 
USA 


Joseph L. Birman 
Department of Physics 
City College of CUNY 
New York, NY 10031 
USA 

H. Eugene Stanley 
Center for Polymer Studies 
Physics Department 
Boston University 
Boston, MA 02215 
USA 


Jeffrey W. Lynn 
Department of Physics 
University of Maryland 
College Park, MD 20742 
USA 

Mikhail Voloshin 
Theoretical Physics Institute 
Tate Laboratory of Physics 
University of Minnesota 
Minneapolis, MN 55455 
USA 


Library of Congress Cataloging-in-Publication Data 
Kaku, Michio. 

Strings, conformal fields, and M-theory / Michio Kaku.—2nd ed. 

p. cm. — (Graduate texts in contemporary physics) 
Includes bibliographical references and index. 

ISBN 978-1-4612-6792-8 ISBN 978-1-4612-0503-6 (eBook) 
DOI 10.1007/978-1-4612-0503-6 

1. String models. 2. Conformal invariants. 3. Quantum field 
theory. I. Title. II. Series. 

QC794.6.S85K355 1999 

530.14—dc21 99-16033 

Printed on acid-free paper. 


© 2000, 1991 Springer Science+Business Media New York 

Originally published by Springer-Verlag New York Berlin Heidelberg in 2000 

Softcover reprint of the hardcover 2nd edition 2000 

All rights reserved. This work may not be translated or copied in whole or in part without the 
written permission of the publisher Springer Science+Business Media, LLC, except for brief 
excerpts in connection with reviews or scholarly analysis. Use in connection with any form of 
information storage and retrieval, electronic adaptation, computer software, or by similar or 
dissimilar methodology now known or hereafter developed is forbidden. The use of general 
descriptive names, trade names, trademarks, etc., in this publication, even if the former are not 
especially identified, is not to be taken as a sign that such names, as understood by the Trade 
Marks and Merchandise Marks Act, may accordingly be used freely by anyone. 

Production managed by A. Orrantia; manufacturing supervised by Jerome Basma. 

Photocomposed copy prepared in IAT],pC by The Bartlett Press, Inc., Marietta, GA. 

987654321 


ISBN 978-1-4612-6792-8 



This book is dedicated to my parents. 



Preface 


String theory (and its latest incarnation, M-theory) because it is the leading 
candidate for a theory of all fundamental physical forces, has advanced at an 
astonishing rate in the last few years. Accordingly, the purpose of this book is 
to acquaint the reader with the most active topics of research in string theory. 
After reading this book, a student will hopefully understand the main areas 
of current progress in string theory, and may be able to engage directly in 
research. The primary focus, therefore, is to place the reader at the forefront 
of current string research. 

This book is complementary to my previous book, Introduction to Su¬ 
perstrings and M-theory , which gives the reader a firm foundation in the 
fundamentals of string theory. It contains new material not covered in that 
book, such as the classification of conformal field theories, the nonpolynomial 
closed string field theory, the matrix models, and topological field theory. 

Then the book discusses at length all the latest discoveries in nonperturbative 
string theory. It discusses D-branes, BPS states, S-T-U dualities, anti-de Sitter 
spaces, black holes, solitons, and M-theory. In the 11th dimension, in fact, 
we may finally have a unification of all five superstring theories into a single, 
comprehensive theory. 

These new developments, for the first time, allow us to probe the previ¬ 
ously hidden nonperturbative region of string theory, and bring us closer to the 
ultimate goal: to find the true vacuum of string theory. 

Although it would be helpful to read Introduction to Superstrings and M- 
theory , it is not absolutely necessary. The overlap between this book with the 
previous one is minimal, but in some chapters, such as One and Nine, I have 
reviewed the necessary background material where needed so this book would 
be self-contained. However, readers are encouraged to consult the previous 



viii 

book for its appendix which contains a brief introduction to group theory, 
supergravity, supersymmetry, the theory of forms, and general relativity. 

Strings, Conformal Fields, and M-theory will be a success if it conveys some 
of the vitality and vigor of current activity in string theory to the reader and 
prepares him or her for research. 


Michio Kaku 



Acknowledgments 


I would like to thank the hospitality of the Institute for Advanced Study at 
Princeton, where this book was written. I would especially like to thank Drs. 
E. Witten and S. Adler for inviting me to come to the Institute. 

I would like to thank Dr. B. Sakita and Dr. J. Birman and the faculty of 
the City College of New York for their constant encouragement and support. 
I would like to acknowledge support from the National Science Foundation, 
Department of Energy, and CUNY-FRAP. 

I would like to thank L. Alvarez Gaume, who once again has given detailed 
and productive comments throughout the entire draft. Both the presentation and 
content of this book has greatly benefited from his insightful comments. I would 
also like to thank D. Karabali, O. Lechtenfeld, L. Hua, P. Huet, B. Grossman, 
and A. Jevicki for reading various chapters of this book and making many 
valuable comments. 



Contents 


Preface vii 

Acknowledgments ix 

I Conformal Field Theory and Perturbation Theory 1 

1 Introduction to Superstrings 3 

1.1 Quantizing the Relativistic String. 9 

1.2 Scattering Amplitudes. 17 

1.3 Supersymmetry. 22 

1.4 2D SUSY Versus 10D SUSY. 25 

1.5 Types of Strings. 30 

1.6 Summary. 32 

2 BPZ Bootstrap and Minimal Models 38 

2.1 Conformal Symmetry in D Dimensions. 38 

2.2 Conformal Group in Two Dimensions. 41 

2.3 Representations of the Conformal Group. 45 

2.4 Fusion Rules and Correlations Function. 47 

2.5 Minimal Models. 51 

2.6 Fusion Rules for Minimal Models. 58 

2.7 Superconformal Minimal Series. 60 

2.8 Summary. 64 

3 WZW Model, Cosets, and Rational Conformal Field Theory 69 

3.1 Compactification and the WZW Model. 69 

3.2 Frenkel-Kac Construction. 75 



















xii Contents 


3.3 GKO Coset Construction. 79 

3.4 Conformal and Current Blocks. 81 

3.5 Racah Coefficients for Rational Conformal 

Field Theory. 84 

3.6 Summary. 91 

4 Modular Invariance and the A-D-E Classification 96 

4.1 Dehn Twists. 96 

4.2 Free Fermion and Boson Characters. 99 

4.3 GSO and Supersymmetry. 105 

4.4 Minimal Model Characters. 106 

4.5 Affine Characters . 108 

4.6 A-D-E Classification. 113 

4.7 Higher Invariants and Simple Currents. 116 

4.8 Diagonalizing the Fusion Rules. 119 

4.9 RCFT: Finite Number of Primary Fields. 122 

4.10 Summary. 125 

5 N = 2 SUSY and Parafermions 130 

5.1 Calabi-Yau Manifolds. 130 

5.2 N = 2 Superconformal Symmetry. 137 

5.3 N = 2 Minimal Series. 141 

5.4 N = 2 Minimal Models and Calabi-Yau Manifolds .... 145 

5.5 Parafermions. 148 

5.6 Supersymmetric Coset Construction . 152 

5.7 Hermitian Spaces. 156 

5.8 Summary. 159 

6 Yang-Baxter Relation 166 

6.1 Statistical Mechanics and Critical Exponents. 166 

6.2 One-Dimensional Ising Model . 168 

6.3 Two-Dimensional Ising Model . 171 

6.4 RSOS and Other Models . 172 

6.5 Yang-Baxter Relation. 179 

6.6 Solitons and the Yang-Baxter Equation. 186 

6.7 Summary. 189 

7 Toward a Classification of Conformal Field Theories 196 

7.1 Feigin-Fuchs Free Fields. 196 

7.2 Free Field Realizations of Coset Theories. 203 

7.3 Landau-Ginzburg Potentials. 206 

7.4 N = 2 Chiral Rings. 209 

7.5 N = 2 Landau-Ginzburg and Catastrophe Theory. 211 

7.6 Zamolodchikov’s c Theorem. 220 





































Contents xiii 

7.7 A-D-E Classification ofc = 1 Theories. 221 

7.8 Summary. 225 

8 Knot Theory and Quantum Groups 231 

8.1 Chem-Simons Approach to Conformal Field Theory . ... 231 

8.2 Elementary Knot Theory . 236 

8.3 Jones Polynomial and the Braid Group. 240 

8.4 Quantum Field Theory and Knot Invariants. 243 

8.5 Knots and Conformal Field Theory. 248 

8.6 New Knot Invariants from Physics. 251 

8.7 Knots and Quantum Groups. 255 

8.8 Hecke and Temperley-Lieb Algebras. 263 

8.9 Summary. 267 

II Nonperturbative Methods 273 

9 String Field Theory 275 

9.1 First Versus Second Quantization. 275 

9.2 Light Cone String Field Theory. 279 

9.3 Free BRST Action. 285 

9.4 Interacting BRST String Field Theory . 289 

9.5 Four-Point Amplitude. 293 

9.6 Superstring Field Theory. 297 

9.7 Picture Changing . 300 

9.8 Superstring Action. 303 

9.9 Summary. 306 

10 Nonpolynomial String Field Theory 313 

10.1 Four-String Interaction . 313 

10.2 A-Sided Polyhedra . 324 

10.3 Nonpolynomial Action . 327 

10.4 Conformal Maps. 331 

10.5 Tadpoles. 337 

10.6 Summary. 341 

11 2D Gravity and Matrix Models 346 

11.1 Exactly Solvable Strings. 346 

11.2 2D Gravity and KPZ. 349 

11.3 Matrix Models. 353 

11.4 Recursion Relations. 357 

11.5 KdV Hierarchy. 362 

11.6 Multimatrix Models. 368 

11.7 D = 1 Matrix Models. 372 

11.8 Summary. 380 




































xiv Contents 


12 Topological Field Theory 385 

12.1 Unbroken Phase of String Theory. 385 

12.2 Topology and Morse Theory. 388 

12.3 Sigma Models and Floer Theory . 394 

12.4 Cohomological Topological Field Theories. 398 

12.5 Correlation Functions. 403 

12.6 Topological Sigma Models. 406 

12.7 Topological 2D Gravity. 408 

12.8 Correlation Functions for 2D Topological Gravity. 410 

12.9 Virasoro Constraint, W -Algebras, and KP Hierarchies ... 414 

12.10 Summary. 420 

13 Seiberg-Witten Theory 427 

13.1 Introduction. 427 

13.2 Electric-Magnetic Duality. 429 

13.3 Holomorphic Potentials. 430 

13.4 N = 1 SUSY QCD. 433 

13.4.1 Nf < N c . 437 

13.4.2 N t = N e . 437 

13.4.3 Nf = N c + l . 438 

13.4.4 N c + 2 < N { <\N C . 438 

13.4.5 \N e < N { < 3N C . 439 

13.4.6 Nf>3N c . 439 

13.4.7 SO(N c ) SUSY Gauge Theory . 439 

13.5 N = 2 SUSY Gauge Theory . 440 

13.6 SU(N) N — 2 SUSY Gauge Theory. 448 

13.7 Summary. 452 

14 M-Theory and Duality 457 

14.1 Introduction. 457 

14.2 Unifying the Five Superstring Theories. 458 

14.3 T Duality . 459 

14.4 S duality. 462 

14.4.1 Type IIA and M-Theory. 462 

14.4.2 Type IIB. 464 

14.4.3 Type I Strings. 466 

14.5 BPS States. 468 

14.6 Supersymmetry and p-Branes. 470 

14.7 Compactification. 473 

14.8 Example: D = 6. 475 

14.8.1 D = 6, N = (2, 2) Theory. 476 

14.8.2 D = 6, N = (1, 1) Theories . 476 

14.8.3 Deletions and Fibrations . 479 

14.9 F-Theory. 479 

14.10 Summary. 481 










































Contents xv 


15 D-Branes and CFT/ADS Duality 487 

15.1 Solitons . 487 

15.2 Supermembrane Action. 489 

15.3 5-Branes and D-Branes. 491 

15.4 D-Brane Actions. 495 

15.5 M(atrix)-Theory and Membranes. 500 

15.6 Black Holes. 504 

15.7 CFT/ADS Duality. 506 

15.8 Anti-de Sitter Space. 511 

15.9 AdS and QCD. 514 

15.10 Summary. 518 

15.11 Conclusion. 522 

Index 525 














Part I 


Conformal Field Theory 
and Perturbation Theory 




CHAPTER 1 


Introduction to Superstrings 


During this century, two great physical theories have emerged. Each theory 
remains unchallenged within its respective domain. 

The first, quantum theory, has given us a theory of the microcosm. At 
the subatomic level, quantum mechanics has unraveled the secrets of mat¬ 
ter and energy. Quantum mechanics has given us the language by which we 
can unite three of the four fundamental forces, the strong and weak forces and 
electromagnetic interactions. 

The second, the general theory of relativity, has given us a theory of the 
macrocosm. At the cosmic level, relativity theory has given us an unrivaled 
description of creation and cosmology. Black holes, warped space-time, and 
the expanding universe are all consequences of this theory of gravitation. 

No experimental deviation from either theory has been seen in the laboratory. 
The entire body of physical knowledge amassed by physicists over the last two 
millennia is based on these two physical theories, without exception. 

The unification of these two theories, however, has eluded some of the 
greatest thinkers of our time. Wolfang Pauli, Werner Heisenberg, and Albert 
Einstein wrestled with the problem and ultimately failed. At the center of this 
puzzle is the realization that the theories are based on different assumptions, 
obey different physical principles, and use different mathematics. Whenever 
attempts are made at merging these two theories, disastrous infinities emerge 
that render the hybrid theory meaningless. 

In particular, these two theories have fundamentally different viewpoints 
regarding the meaning of a force. Quantum theory reduces all forces to the 
exchange of tiny discrete packets of energy, called “quanta.” General relativity, 
however, considers forces to be an apparent effect, the consequence of the 
smooth distortion of space and time. 




4 1. Introduction to Superstrings 


At present, superstring theory and its latest incarnation, M-theory [1-7] has 
emerged as the leading candidate for the unification of these two theories and 
hence all known forces. No other theory can make the claim of unifying both 
general relativity and quantum mechanics into a simple, self-consistent formal¬ 
ism. Calculations carried out to the eighth-loop order, and some calculations 
to all loop orders, show that the potential divergences inherent in other quan¬ 
tum field theories miraculously cancel in string theory due to the enormously 
powerful symmetries built into the theory. From a technical point of view, 
superstring theory seems to be totally free of quantum anomalies and diver¬ 
gences, which riddle all known point-particle theories of gravity and matter. 
Its successes can be summarized as follows: 

(1) The theory explains the origin of particles and resonances. The myriad 
particles found in nature can be viewed as the vibrations of a string, in 
much the same way that the notes found in music can be explained as the 
modes of a vibrating string. Pursuing this analogy, the basic particles of 
our world correspond to the musical notes of the superstring, the laws of 
physics correspond to the harmonies that these notes obey, and the universe 
itself corresponds to a symphony of superstrings. 

(2) The theory explains the unification of gravity with matter. The simplest 
vibration of a closed string in curved space, in fact, corresponds to a mass¬ 
less spin-2 object. The gauge symmetries of string theory show that this 
spin-2 particle has all the properties of the graviton. Moreover, when the 
string executes its motions, it actually forces space-time to curl up around 
it, yielding the complete set of Einstein’s equations of motion. Thus, the 
string naturally merges the two divergent pictures of a force: the modes of 
vibration are quantized, but the string can only self-consistently vibrate in 
a curved space-time consistent with Einstein’s equations of motion. 

(3) The theory explains how to remove the divergences found in quantum grav¬ 
ity. In quantum field theory, point-particles interact via Feynman graphs, 
which badly diverge when the graph is “pinched,” that is, when one of 
the legs of the graph shrinks to zero. When the string moves, it repeatedly 
splits and reforms, thereby tracing the topology of two-dimensional sheets 
or Riemann surfaces, such as a doughnut. However, since it is difficult 
to pinch or stretch a doughnut, one can show that the string graphs are 
actually ultraviolet finite. Thus, the topology of the string removes the 
divergences of quantum gravity. 

(4) The theory shows how to simply incorporate the symmetries of the standard 
model. In the heterotic string, for example, one set of vibrations exists in 
26 dimensions. However, since the superstring exists in 10 dimensions, 
we let 16 unwanted dimensions curl up like a ball. If we compactify on 
the root lattice of a rank 16 Lie algebra, such as Eg ® E% 9 then we have a 
theory that is large enough to accommodate SU(3) ® SU(2) ® 1/(1). 

(5) The latest version of superstring theory, called M-theory, is powerful 
enough to unify the five superstring theories into a simple, coherent 



1. Introduction to Superstrings 5 


framework in the eleventh dimension, where membranes and 5-branes 
are present. Moreover, S-T-U dualities allow us to probe the previously 
forbidden nonperturbative region of string theory, where new soliton-like 
states exist, such as D-branes, black holes, and various membranes. The 
powerful dualities, which link the weak coupling, perturbative region with 
the strong coupling, nonperturbative region, may one day shed light on the 
ultimate goal: to find the true vacuum of string theory. 

In spite of the simplicity of string theory and the success of string 
phenomenology, there are, of course, several formidable obstacles, both ex¬ 
perimental and theoretical, that must be overcome before the theory can be 
accepted. 

Experimentally, unification in superstring theory takes place at the Planck 
length (10 19 GeV), and hence experimental verification of this theory remains 
problematic. Even the largest particle accelerators or observations from cosmic 
ray detectors and satellites will, at best, be able to probe only indirect signals 
emerging from the Planck length. One hope is that the Large Hadron Collider 
(LHC) will observe supersymmetric partners of the lower lying particles and 
hence give us indirect clues to the superstring itself. However, to probe energies 
at the Planck length is impossible with conceivable technology. 

The origin of this problem is that superstring theory (or any theory that 
claims to unite all four forces) is inherently a theory of creation, where all four 
forces were united into a single superforce. Thus, to experimentally verify 
superstring theory means recreating creation in the laboratory. 

Although this seemingly insurmountable experimental barrier is usually 
cited as the chief criticism of superstring theory, our philosophy is very differ¬ 
ent. We feel that the main problem facing superstring theory is theoretical, not 
experimental. If we could fathom the principles underlying the theory, then 
we would be able to calculate the physics of our low-energy world, and settle, 
once and for all, whether superstring theory is correct or not. Since the ba¬ 
sic equations of superstring theory (at least perturbatively) are well known, it 
means that we are not clever enough, with our limited mathematical skills, to 
solve the theory. 

If superstring theory, as some have claimed, is twentyfirst-century physics 
that fell accidentally into the twentieth century, then the fundamental problem 
facing us is that twentyfirst-century mathematics powerful enough to solve 
twentyfirst-century physics has not been discovered. 

As Richard Feynman once said, one of the goals of a theoretical physicist is 
to “prove yourself wrong as soon as possible.” Thus, the real problem facing 
us, in our opinion, is to theoretically settle the following question as quickly as 
possible: what is the true vacuum (ground state) of superstring theory? Since 
the ground state should correspond to our physical universe, if the true vacuum 
could be discovered, we might be able to decisively settle whether superstring 
theory is a theory of the universe or just the latest in a series of failed efforts 
to discover the Holy Grail of physics, the unified field theory. 



6 1. Introduction to Superstrings 


Once the true ground state of the superstring theory is found, we should be 
able to calculate the quantum numbers and physical properties of the various 
particles found in nature and compare them with actual experimental data. In 
theory, starting from the superstring action and extracting its ground state, we 
should be able to calculate all the basic parameters of the universe from first 
principles. 

The search for the true vacuum of string theory is therefore the central 
theme of this book. It is the theme that binds the various chapters together and 
determines the structure of this book. It is also the theme that drives almost all 
current research in superstring theory. 

As a consequence, there are two parts to this book: 

(1) perturbation theory and conformal field theory; and 

(2) nonperturbative methods. 

In the first part, we focus on the perturbative vacuum, which can be described 
using conformal field theory. The basic strategy for the first part of this book is 
therefore to classify all possible conformal field theories. Although a complete 
classification of conformal field theories that reveals the deeper relationship 
between perturbative vacuums still lies beyond our grasp, we will review the 
enormous progress made in this direction. 

One of the successes of the conformal field theory approach is that, with 
only a few mild constraints, one finds reasonably acceptable candidates for 
practical phenomenology. For example, the heterotic string contains the gauge 
group E% ® E%. By making a few reasonable assumptions about its broken 
phase, it is possible to break this group down to £ 8 ® E 6 and finally to £ 6 , 
which contains the standard model’s gauge group, SU(3) ® SU(2) ® U(\). 
The basic fermion multiplet naturally occurs in the 27 multiplet of E 6 , which 
is consistent with known grand unified theory (GUT) phenomenology. 

Thus, it is surprising that, with very few minimal assumptions, we are 
naturally led to the following symmetry-breaking scheme 

E S ®E%->E S ®E 6 -+E 6 ^ SU(3) ® SU( 2) ® 1/(1). (1.0.1) 

Not only is the theory internally self-consistent after unifying quantum me¬ 
chanics with general relativity, the theory also goes far beyond the usual GUT 
description of quarks and leptons. The gauge hierarchy problem, for exam¬ 
ple, which is impossible to solve within the framework of the standard GUT, 
is naturally explained by the supersymmetry (SUSY) of the superstring. Fur¬ 
thermore, the generation problem, which has plagued all GUTs, is naturally 
explained in string theory via topological arguments. For example, it is now 
relatively easy to construct pheonomenological models that contain only the 
three experimentally observed fermion generations. 

In spite of the early successes of the conformal field theory approach to the 
perturbative vacuum, there are several fundamental flaws with this approach 
that force us to study nonperturbative effects. 



1. Introduction to Superstrings 7 


(1) Millions of conformal field theories have been discovered, each one rep¬ 
resenting a possible (perturbative) ground state of the theory. Which one 
is correct? No single scheme has been found that can classify them in 
a coherent fashion. In Chapter 7, we will study some of the more pow¬ 
erful classification schemes so far proposed, but all fall far short of an 
all-embracing classification scheme. This, in turn, has made it difficult, if 
not impossible, to decide which of the millions of conformal field theories 
describes our known universe. 

(2) String phenomenology, although surprisingly successful, suffers from a 
fatal defect. We can show to all orders in perturbation theory that super- 
symmetry is unbroken. However, our low-energy world does not manifest 
supersymmetry, which must therefore be dynamically broken. The low- 
energy spectrum of string theory thus cannot give us a correct description 
of the physical spectrum of particles. For example, it includes massless 
particles, such as the dilaton, which have not been seen in nature. 

(3) More important, 10-dimensional space-time also seems perfectly stable 
in perturbation theory. At present, the breaking of 10-dimensional space 
down to four dimensions is purely an ad hoc assumption that is built into 
conformal field theory from the very beginning, not a consequence of string 
dynamics. Until this fundamental problem is solved nonperturbatively, all 
the predictions of perturbative string theory are suspect. 

(4) Conformal field theories may also be unstable at the Planck energy level. 
Studies of the higher loops of bosonic theory show that the theory is not 
Borel summable, and hence, the theory is not stable when power is ex¬ 
panded perturbatively around a conformal field theory. [We should note 
that quantum electrodynamics (QED), quantum chromodynamics (QCD), 
and gauge theories in general are also not Borel summable, but we implic¬ 
itly assume that they can be embedded into a higher theory that is finite. 
However, superstring theory, making the claim of being the final theory, 
cannot therefore be embedded into a higher theory.] This instability seems 
to be independent of the existence of the tachyon and hence may persist 
even in the entire superstring theory. 

If perturbation theory is flawed because it fails to precisely describe our 
low-energy universe and is unstable, then we must resort to nonperturbative 
calculations, which takes us to the second part of this book. 

Although nonperturbative calculations are notoriously difficult to perform, 
even in point-particle theories, in the second part of this book we review what 
is known about nonperturbative approaches to string theory. Although most 
superstring research is presently confined to perturbation theory, the future of 
superstring research in the coming years may lie in the nonperturbative realm. 

We review all the major approaches to nonperturbative string theory. String 
field theory is a difficult but promising formalism. String field theory success¬ 
fully unifies all known information about string theory in a simple, second 
quantized action, rather than appealing to folklore and rules of thumb. 



8 1. Introduction to Superstrings 


String field theory, like other nonperturbative approaches, is still too difficult 
to solve. Systems with an infinite number of degrees of freedom have tradi¬ 
tionally been difficult to solve, and string field theory is no exception. Recent 
research on much simpler “toy” models, with finite degrees of freedom, has 
enjoyed some success. 

Some of the most important systems with finite degrees of freedom are 
topological models. In fact, topology will prove to be one of the most powerful 
tools are our disposal to calculate the properties of the vacuum of string theory. 
Topology enters into string theory in at least three ways. 

First, topological arguments are crucial in determining the phenomenologi¬ 
cal predictions of the theory once we have compactified the theory down to four 
dimensions. In particular, topological invariants defined on six-dimensional 
manifolds are the key ingredients necessary to solve for the properties of na¬ 
ture. For example, the Euler number is related to the generation number found 
in GUTs. 

Second, topology is useful in classifying the possible vacuums of the theory, 
that is, the various conformal field theories. In particular, knot theory, which 
until recently had no practical application in theoretical physics, is now known 
to be crucial in classifying the so-called rational conformal field theories. We 
will find that the “Clebsch-Gordan coefficients” for the conformal group can 
be determined by the braiding operations performed on knots, and that we 
can calculate the known knot invariants and even infinite classes of new ones 
directly from physics. 

Third, topology has played a key role in solving string theory in D < 1. 
Certain topological field theories, in fact, can be shown to be equivalent to the 
matrix model formulation of string theory in low dimensions, which gives us a 
complete nonperturbative solution to the Green’s functions to all orders in the 
genus. In these D < 1 models, the theory has only a finite number of degrees of 
freedom, and hence topological arguments can be used to solve them exactly. 

However, of all the nonperturbative formalisms that have been proposed, the 
only one which successfully penetrates into the nonperturbative realm in four 
dimensions and beyond is M-theory. Not only can M-theory combine all five 
superstring theories into a single theory, it can shed light on the nonperturbative 
region of string theory via duality. Dual relations can be written between string 
theories in the weak and strong coupling region, which also reveal the existence 
of a new class of previously neglected soliton-like objects, called p-branes. 
Unlike ordinary point-particle field theories in four dimensions, which are 
probably too complicated to solve analytically, supersymmetric field theories 
are sometimes exactly soluble, with dualities which connect the perturbative 
and nonperturbative region. This may be the key to ultimately finding the true 
vacuum of string theory. 

We begin the first part of this book with a discussion of conformal field 
theory, which has emerged as a powerful tool by which to probe the perturbative 
phenomenology of the theory. It is conjectured that each conformal field theory 
corresponds to a possible string vacuum. Thus, by classifying all possible 



1.1 Quantizing the Relativistic String 9 


conformal field theories, we hope to exhaust all possible vacuums allowed by 
superstring theory. Perhaps one of them describes the physical universe. 

This chapter is self-contained and will hopefully serve as a brief introduction 
to superstring theory. Students with a familiarity of elementary string theory 
may skip this chapter. However, the reader is encouraged to consult Introduc¬ 
tion to Superstrings and M-Theory for certain concepts that may be explained 
at greater length and depth. 


1.1 Quantizing the Relativistic String 

To begin our discussion of string theory and conformal field theory, let us 
study the first quantized action of the relativistic string. If a point-particle 
moves in space-time, it sweeps out a one-dimensional world line. Likewise, 
as a string moves in space-time, it sweeps out a two-dimensional world sheet. 
Let X^o, r) represent a vector defined in D-dimensional space-time that 
begins at the origin of our coordinate system and ends at some point along 
the two-dimensional string world sheet, labeled by — {a, r}. Let rj^ = 
(—, +, +,+,•• •) be the flat metric in D-dimensional space-time, where p = 
0, 1, 2,..., D — 1. [We will use Roman letters (a,b,c, ...) to represent two- 
dimensional world sheet indices, and Greek letters (p, v ,...) to represent D- 
dimensional space-time indices.] 

Let us take our action to be the area of the world sheet swept out by 
the string [8-9]. Because the area of the world sheet is independent of the 
two-dimensional coordinates used to measure the area, it must be a generally 
covariant scalar in two dimensions. 

There are several ways in which we can write the area of the string world 
sheet. The simplest is to introduce the tensor g ab , which represents a two- 
dimensional metric defined on the surface. 

Our action can be written as [10] 

s = / dl% ^ §ab 9 * Xm dbXv * r ’ (LU) 

where a f = \ for open strings and a' = \ for closed strings. The action is man¬ 
ifestly reparametrization invariant. If we reparametrize the two-dimensional 
world sheet according to 

a —> <j(cr, t), r —> r(a, r), (1.1.2) 

then the action is invariant under this two-dimensional general coordinate 
transformation if 


g ab w = 



(1.1.3) 



10 1. Introduction to Superstrings 


where x a = {<j, r}. Under this transformation, the action is manifestly co¬ 
ordinate invariant (because the transformation of Jg cancels against the 
transformation of the two-dimensional measure). 

If we take an infinitesimal transformation, then the transformation of the 
fields becomes 

- s flC d c € b - g bc d c € a , t 

sx^ = € a a a x M . ( • • ’ 

The action is also trivially invariant under local scale transformations 

g* -* e+g ab . (1.1.5) 

(When we quantize the system, we will find that the classical scale invariance 
of the system actually breaks down, and the metric tensor obeys the equations 
of Liouville theory. This conformal anomaly disappears in 26 dimensions for 
bosonic strings and in 10 dimensions for the superstring. In later chapters, 
we will discuss how to quantize the two-dimensional gravitational system in 
dimensions other than 10 and 26, where the Liouville mode plays a crucial 
role.) 

The two-dimensional metric in the action does not have any derivatives, and 
hence we may classically eliminate it via its equations of motion. Then, we 
find 


gab ^d a x^d b x». ( 1 . 1 . 6 ) 

Reinserting this value of the metric tensor back into the action, we find the 
Nambu-Goto action, a nonlinear action written totally in terms of the string 
variable [8, 9] 

S = ^ (1-1-7) 

where equals d T X M and X ,fi equals d a X^. Notice that the above ex pression 
is proportional to the area of the world sheet, given by the integral of y/detg ab . 

The action is now written as the determinant of the metric tensor of the world 
sheet, which shows that the action is proportional to the two-dimensional area 
swept out by the string. It is remarkable that string theory, which provides 
a comprehensive scheme in which to unite general relativity with quantum 
mechanics and all known physical forces, begins with this simple statement: 
the action is proportional to the area of the string world sheet. 

In order to extract the physical predictions of the theory, we must first quan¬ 
tize the theory and extract its physical states. The quantization of the string 
action is nontrivial, however, because the system has a powerful symmetry and 
is hence highly redundant in its degrees of freedom. 

Over the years, three equivalent quantization schemes have been devel¬ 
oped: (1) Gupta-Bleuler quantization; (2) light cone quantization; and (3) 
Becchi-Rouet-Stora-Tyupin (BRST) quantization. Each method has its own 
advantages and disadvantages. 



1.1 Quantizing the Relativistic String 11 


Gupta-Bleuler Quantization 

Because the action possesses local reparametrization invariance (with two pa¬ 
rameters) and scale invariance (with one parameter), we are allowed to impose 
a total of three constraints on the metric tensor. This will break the reparam¬ 
etrization invariance and will allow ghosts to propagate in the theory because 
of the wrong sign of the time component of the string field in the propagator. 
However, these ghost states will be eliminated by directly applying constraints 
on the states of the theory. 

Let us choose the conformal gauge, in which all components of the metric 
tensor are set to constants: 

conformal gauge : g ab = S ab . (1.1.8) 

Then, our Lagrangian linearizes to the following [11]: 

L = 4^7 [W 2 + (n) 2 ] = 2^ w (1-1-9) 

where 


z = cr + it. 


( 1 . 1 . 10 ) 


where we have performed a Wick rotation so the system is conformally 
invariant and where the equations of motion become 

a 2 


of 


/ a 2 a 2 \ 

(a 3 ® + ¥i) x “ = 0 ‘ (1U1) 

The equations of motion can now be trivially solved in terms of functions 


X fX (o±i r), (1.1.12) 

which in turn can be decomposed into cosine modes (for open strings) or into 
cosine and sine modes (for closed strings). 

Notice that the action, while no longer locally reparametrization invariant, 
is still globally invariant under conformal transformations [11] 

( 1 . 1 . 13 ) 

Under conformal transformations, the string transforms as 

SX^iz, z) = €(z) d z X „ 4- €(z) 3,X m . (1.1.14) 

Written in the conformal gauge, the action can be trivially quantized because 
it has been reduced to a set of free harmonic oscillators. If we define the 
canonical momentum to the string variable as 

[^(a), X v (a’)] = -in^Sia - a'), ^ 

= SL/SX 



12 1. Introduction to Superstrings 


then we can decompose the string variable into harmonic oscillators [12] 


1 


X^(a) = + i ^2 ~r ( a n ~ a -n ) cos na * 

n=l V n 


P»{o) = - 

71 


+ ^2 'ft* ( a n + a -n ) C0S na 


n = 1 


(1.1.16) 


where the commutation relations are satisfied if 


(1.1.17) 

Then, the spectrum of string theory is given by the eigenvalues of the 
Hamiltonian: 

C 1Z oo 

H= <ia(i>,P-L) = rn M :+a>J. (1.1.18) 

Jo „=i 

Since the Hamiltonian is just the sum over an infinite set of uncoupled, free 
harmonic oscillators, the spectrum consists of the sum total of all possible 
products of harmonic oscillator states [12] 

n^,)i°>- ( u - 19 ) 

n,pi 

To analyze the symmetries of the system, let us calculate the energy- 
momentum tensor corresponding to the action. The energy-momentum tensor 
is defined as 

Tab = - (1.1-20) 

■sfg Sg ab 

This, in turn, can be shown to equal 

r ab = a fl x M d b x» - \ gabg cd d c x* d d x„. (1.1.21) 

We notice several important features of the energy-momentum tensor, that 
is, it satisfies 

d b T ab = 0, Tr T a b = 0. (1.1.22) 

The first statement simply states that the energy-momentum tensor is con¬ 
served. The second statement states that it is traceless, that is, it is scale 
invariant. 

For us, perhaps the most important feature of the energy-momentum tensor 
is that it forms an algebra that closes. Let us first define the Virasoro generators 
L n as the Fourier moments of the energy-momentum tensor [13] 

L m = -L da [e ima (Tw + T m ) + e- imo {Tn - r 01 )] 

4na' Jo L 

= 8 ^ + r,J l 



1.1 Quantizing the Relativistic String 13 


i oo 

= r £ «»-»«„. (1-1.23) 

n=—o o 

These generators, L n , which will appear throughout this book, in turn form 
a closed algebra, the Virasoro algebra (which generates the conformal group) 

[L„, L m ] = (n - m)L n+m + y^«(« + 1)(« - (1.1.24) 

where c is a constant. 

In the Gupta-Bleuler quantization scheme, we allow ghosts to propagate 
in the theory. However, unitarity is reestablished by applying the gauge 
constraints directly on the Fock space. Thus, we apply 


L„\R) =0, n > 0, 
(L 0 -l)|/O=0, 


(1.1.25) 


where the second condition is the mass-shell condition. The solution of these 
constrains is the set of real states |/?) of the theory. (The actual proof, however, 
that these conditions are sufficient to eliminate all ghost states is quite involved 
[14, 15]. The proof shows that string theory is ghost-free if the dimension of 
space-time is 26 and the intercept, or the highest spin of the massless sector 
of the theory, is equal to 1 for open strings and 2 for closed strings.) 

Spurious states are those that do not couple to the real states, that is, 

{S\R) — 0. (1.1.26) 


This equation, in turn, implies that any spurious state contains Virasoro 
generators 

1^) = L_£jL_£ 2 , ..., L-k n \R). (1.1.27) 

The advantage of the Gupta-Bleuler method is that Lorentz covariance 
is maintained throughout. However, the proof that unphysical states are 
eliminated by the Virasoro generators is highly nontrivial. Hence, Lorentz 
covariance is manifest in the Gupta-Bleuler formalism but unitarity is not. 

We now turn to another quantization scheme where only the physical states 
are present and unitarity is manifest. 


Light Cone Quantization 

The advantage of the light cone quantization method, as in the case of ordinary 
point-particle gauge theories, is that the theory is manifestly ghost-free, and 
hence, only physical states propagate. All gauge constraints have been elimi¬ 
nated explicitly, leaving only the physical Hilbert space of transverse modes. 
The essence of the light cone quantization method lies in eliminating the redun¬ 
dant longitudinal modes by gauge fixing two degrees of freedom and solving 
the constraints. 



14 1. Introduction to Superstrings 


We will define the light cone coordinates as 

X + = -L(x 0 + X d “‘), 

Z- = -L(xO-X-), 

and fix the gauge as [16]: 


X + (a, r) = p + T. 


(1.1.28) 


(1.1.29) 


The momentum canonical to X^ is P 11 , whose components are not all in¬ 
dependent. If we take Eq. (1.1.7) as our action and then take derivatives with 
respect to X tl to form P,,, we find that the resulting momenta, which are com¬ 
plicated nonlinear objects, are not independent but obey the following simple 
identities 

P 2 -I- ^ x' 2 = 0 

M {2na>Y " ’ (1.1.30) 

P^X'* = 0 . 

The light cone quantization program begins by solving these constraints to 
eliminate the redundant longitudinal modes 



The Hamiltonian in the light cone gauge reduces to 



(1.1.31) 


(1.1.32) 


so that the physical Hilbert space corresponds to the set of all transverse 
harmonic oscillator states. 

The disadvantage of the light cone formalism, of course, is that Lorentz 
covariance is broken and, in fact, must be reestablished at each step of the way. 
We check Lorentz invariance by constructing the Lorentz generators 


M^ 


= f da(X^P v 
Jo 


— x^p v — x v p^ 


-X V P IX ) 

00 , 

1 (“-»“»-«-»<)• 

n =1 


(1.1.33) 


Checking Lorentz invariance now amounts to reinserting the constrained 
values of X ± and P ± into the Lorentz generators. The only difficult commutator 



1.1 Quantizing the Relativistic String 15 


is given by the following [16]: 

1 oo 

[M~‘, M~i] = -— 2 J2 ( a -« aJ n ~ 0-1-34) 

P n =1 

where 

n 1 (D-26 \ 

A„ = -(26-D) + -(—— + 2-2a), (1.1.35) 

where a is the intercept. In order to have Lorentz invariance, we must set A„ 
equal to zero, that is, 


D = 26, a = 1. (1.1.36) 

In the Gupta-Bleuler quantization, conformal invariance alone fixes the 
dimension of space-time to 26. In the light cone case, because Lorentz invari¬ 
ance and conformal invariance are now mixed nontrivially together, Lorentz 
invariance fixes D = 26. 


BRST Quantization 

The BRST quantization method [17, 18] combines the best aspects of each 
of the previous quantization methods. Like the Gupta-Bleuler quantization 
scheme, it is Lorentz invariant; and like the light cone quantization, we can 
easily extract the physical states (and the D — 26 constraint). 

The BRST method begins with the conformal gauge, but then uses the 
Faddeev-Popov method to introduce ghost states. These ghost states allow 
us to maintain covariance throughout the calculation; the final theory remains 
unitary because these ghosts will cancel against unphysical states. 

Let us rewrite the gauge transformation of the metric as 

8g ab = gacd b Sv c 4- 3 a Sv c g cb - Sv c d c g ab == V a Sv b 4- V h 8v a . (1.1.37) 

The Faddeev-Popov determinant [19] arises because we cannot simply in¬ 
sert the conformal gauge [Eq. (1.1.8)] into the functional integral because it will 
contribute an incorrect measure. In fact, the correct insertion of the conformal 
gauge is given by the number one: 

1 = 8(g ab ~ <$ a fc)A F p. (1.1.38) 

This determinant A F p can be shown to equal the determinant of the variation 
of Eq. (1.1.37) with respect to 8v a . This, however, is just the determinant of 
the operator V fl . 

The Faddeev-Popov determinant can thus be rewritten as 

A fp = det(V a ) = det V z det V z . (1.1.39) 

We now use the fact that any determinant can be rewritten as an exponential 
integral. Normally, the functional integral over a Gaussian yields a determinant 



16 1. Introduction to Superstrings 


in the denominator. Because the Faddeev-Popov determinant appears in the 
numerator, rather than the denominator, we must introduce functional integra¬ 
tion over Grassmann variables, denoted by b, c, which are the Faddeev-Popov 
ghosts. 

We can rewrite the Faddeev-Popov determinant as [19] 

A fp = J DbDbDcDc Lbcd2 \ (1.1.40) 

where 

L bc = -(bd- z c + bd z c). (1.1.41) 

TC v 7 

This action, in turn, has an additional symmetry associated with it, called the 
BRST symmetry. This symmetry is a global one, so no fields can be eliminated 
through this symmetry. The generator of this global symmetry is given by [20] 

00 

£ ■■C- n (LZ + ±L?-a8 n . o) 

n=—o c 

oc 

= C 0 (L 0 Cl) 4“ ^ ' {c—n.Ln ~b 

n= 1 

l oo 

- - ! C -mC-nb n +m (m ~ Yl), (1.1.42) 

n,m=—oc 

where 

{c n ,b m } = 8 n ,- m (1.1.43) 

and 

1 00 £) 

Q 2 = 2 ^2 — (m 3 - m) + |(m - 13m 3 ) + 2am c m C- m , (1.1.44) 

m=—oc _ 

which vanishes only if D = 26 and a — 1, as before. (The double dots indicate 
normal ordering, that is, the creation oscillators with negative indices appear 
to the left, and the annihilation oscillators with positive indices appear to the 
right.) 

Although the derivation of the BRST operator Q came from the conformal 
gauge, its actual origin is quite general, independent of any gauge. For example, 
given any Lie algebra [r a , r b ] = fl h x c , with generators r a , it is possible to con¬ 
struct a nilpotent operator Q such that Q 2 = 0 by introducing anticommuting 
variables b , c. Notice that [21] 

00 

Q = L ~ ifnmC-mbp) (1.1.45) 

n =—oo 

satisfies the identity Q 2 = 0 if the Jacobi identity is satisfied for the algebra. 



1.2 Scattering Amplitudes 17 


The addition of the ghost states has vastly increased the Fock space of the 
theory, which now consists of all possible products over the string creation 
oscillators and ghost oscillators 

n {«t„}{b- m }{c. p ) |0>. (1.1.46) 

The last step in the BRST quantization program is to eliminate the unphysical 
states by applying the operator Q onto states 

QW) = 0. (1.1.47) 


We don’t count the states that vanish trivially, that is, those that can be 
written as | \js) = Q\X) for some |A). 

Thus, the criterion for physical states is 


Q | physical) = 0, 
Iphysical) ^Q\X). 


(1.1.48) 


Mathematically speaking, we say that the physical states lie within the coho¬ 
mology of the BRST operator Q. They are given by the kernel of the operator 
<2, divided by the image of Q , 


{ker Q} 




(im Q} 


(1.1.49) 


1.2 Scattering Amplitudes 

This completes the discussion of the free string. In order to discuss how to con¬ 
struct scattering amplitudes for strings, let us quickly review the development 
of the interacting string. 

String theory developed quite by accident when Veneziano [22] and Suzuki 
[23] stumbled across the Euler beta function, which seemed to satisfy all the 
properties of an S matrix (scattering matrix) for hadronic scattering. The one 
property that the beta function failed to satisfy was unitarity. 

Then, Kikkawa, Sakita, and Virasoro (KSV) [24] postulated that the beta 
function should be treated as the lowest-order Bom term in a Feynman-like per¬ 
turbation series involving multiloops. This conjecture was verified when Kaku, 
Yu, Lovelace, and Alessandrini [25-27] actually constructed the multiloop 
amplitudes by sewing together tree diagrams. The integrand of these higher- 
order amplitudes was given by the solution to Laplace’s equation defined on a 
Riemann surface of genus g. 

This perturbation series in terms of Riemann surfaces was given an elegant 
interpretation in terms of path integrals. Hsue, Sakita, and Virasoro [28] showed 
that the entire perturbation series could be written as a path integral summed 
over all conformally inequivalent Riemann surfaces of genus g (see Fig. 1.1.), 



18 1. Introduction to Superstrings 


x■£3 ■ in 



( 1 . 2 . 1 ) 

where d{i is a conformal measure. Because of the simplicity of the action 
(which is that of a free theory), the N ~-point functions are all exactly calculable. 

There are two direct ways in which to actually calculate the N-point function 
from the functional integral. The first is to calculate the functional integral over 
the complex plane, and the second is to use harmonic oscillators. 

The first method performs the functional integral by shifting the X M variable 
by a classical solution. For open strings, the region of the complex plane over 
which we wish to integrate is an infinite horizontal strip, sitting on the x axis, 
with a width of n. The points Zi where momenta enter into the strip, are 
located on the real axis. By an exponential conformal transformation, we can 
map this horizontal strip to the upper half-plane. (For closed strings, the strip 
will have width 2n and will be mapped to the entire complex plane. The points 
H will be located throughout the complex plane.) 

Let us shift the integration variable by a solution to the classical equation 


An = f DX » f d/xex P * f d 2 zL(z) + Y^iki. v X v (Zi) 

Topologies ^ ** J i =1 

= e .M'n-')■ 

Topologies 4 ' \/=l / 


^ ^/x,classical “F Xfj 


( 1 . 2 . 2 ) 


where the classical solution is determined via the Green’s function for 
Laplace’s equation on the upper half-plane 

classical ~ ^ ^ G(£, Z ) J (Z ) dz , 

G(z, z') = In \z — z'\ + In \z — z *'|. 


(1.2.3) 



1.2 Scattering Amplitudes 19 


The integral is easy to perform, since it is just a Gaussian in the string 
variable, so we find 


A n 


P N -1 

-I n- 


dzi ]""[ | Zi-zj 

i =3 2 <i<j<N 


\ k ‘ k i 


(1.2.4) 


where we have used projective invariance SL( 2, R) to fix oo = z\ > z 2 = 
1 > Z3 * • • Zn-i > Za^ = 0. 

The other equivalent way is to convert path integrals to harmonic oscillators 
by taking vertical “time slices” along the horizontal strip. The Hamiltonian on 
the conformal surface is related to L 0 , so the propagator becomes 

/*°° 1 

D= e~ z{Lo ~ l) dT = -, (1.2.5) 

Jo To — 1 

while the vertex function becomes 


V(k) = : e lktiX » : = exp I k • ^ — j exp f - k - ^ 


n =1 


n =1 


Ctn 

n 


The W-point function becomes 


( 1 . 2 . 6 ) 


An = (0,*i|V(t 2 )DV(ft 3 )--- V(k N -i)\0,k N ). (1.2.7) 


When this is explicitly evaluated (using, e.g., coherent states), we find the 
previous expression in Eq. (1.2.4). For N = 4, this becomes the celebrated 
Veneziano formula 

fdxx-l'-M-XT" 2 -' = r ^: a(j / ) ! r[ ~“ ( ' )1 . (1.2.8) 

Jo rj-aO) - «(/)] 

where a(s ) = 1 + a(t ) = 1 + \t, s = ~{k x + k 2 f, and t = ~(k 2 + £ 3 ) 2 . 

The accidental discovery of this formula in 1968 by Veneziano and Suzuki 
[22, 23], who were trying to describe the scattering matrix for hadronic 
interactions, marked the birth of what eventually became superstring theory. 

We should also emphasize that strings come in two types, open and closed. 
Closed strings are described in almost exactly the same terms as open strings, 
except the number of oscillators is doubled, and their amplitudes are defined 
in the entire complex plane, not just the upper half-plane. 

We can decompose the string variable in terms of two sets of commuting 
harmonic oscillators 


/ a '\ l/2 °° i 

+ (y) g ^{a n e- in °+aj n °+ale in °+a\e- in °)^ 

1 oo 

+ E ^ (- tine-' n °-ia n e‘™+iale‘ n ° +*a„V'%, 

(1.2.9) 


where X M (0) = X^ln). 



20 1. Introduction to Superstrings 


The Hamiltonian now also has doubled the number of oscillators 

/ 2k / X' 2 \ °° 

da Lx'P 2 + ^^7 j = J2^ na n a » + + u 'pI • O- 2 - 10 ) 

The important feature of the closed string spectrum is that it possesses a 
massless spin-2 particle, that is, the graviton. In fact, this is a general feature 
of string theory; the graviton and hence relativity are unavoidable parts of the 
spectrum. This, in fact, is perhaps the most attractive, and most mysterious, 
feature of string theory, that relativity is an essential part of the theory. While 
other point-particle theories try to avoid including the graviton, string theory 
views gravity as an inseparable part of its formulation. 

The spectrum, at its lowest level, now consists of the tachyon, represented 
by the vacuum |0) and the graviton a^a v ^ |0). We will, by convention, set the 
slope a' to be \ and the slope of the closed strings to be \. This means that the 
tachyon appears at s — —8. 

The propagator for closed strings is similar to the open string propagator, 
except for one difference: there is an extra rotation factor that guarantees that 
the final result is not dependent on the origin of the parametrization. Thus, the 
propagator is 

- 1 - P, (1.2.11) 

To + To — 2 


where 


P = 



d0 e i6 d-°- L °\ 


The propagator can be written in an equivalent way, 


( 1 . 2 . 12 ) 


D — — I z u - 2 z u ~ 2 d 2 z = 10 - --=-• (1.2.13) 

2?r J\z\<i n(Lo — To) To + To — 2 

Notice that the operator P can also be interpreted as a projection operator, 
which forces To — To acting on the states to be zero. Thus, the spectrum of 
the closed string model is now determined by the following constraints (for 
n > 0 ): 

L n \4>) = L„\4>)=0, 

(T 0 + To — 2)|0) = 0, (1.2.14) 

(T 0 - Um = 0, 

where the last constraint is due to the fact that the states should be independent 
of where we chose the origin of our parametrization. 

The N -point function can now be calculated in two equivalent ways, by the 
oscillator method and the functional method. The oscillator expression for the 



1.2 Scattering Amplitudes 21 


Appoint function is still of the form: 

A n = 52(* 1 ,0|V(fe)DV(* 3 )-"Z)V(ifcAr- I )|ik w ,0), (1.2.15) 

perm 

where the new feature is that we have to sum over all permutations of the order 
of the external lines (since all of them are defined on the sphere). 

For the functional method, the integral for -point functions is now defined 
over the entire complex plane, rather than just the upper half-plane. Performing 
the functional integral, we obtain 

An = f d/j, ]”[ \n - Zj\ (l/2)krk J , (1.2.16) 

^ 2 <i<j<N 

where the measure is given by 

Setting N = 4, we find the amplitude A(s y r, u) for the Shapiro-Virasoro 
model [29, 30] 

A 4 (s, t, u) 

= _ r[q(s)/2]r[ - tt(p/ 2 ]r[ - «(«)/2] _ 

r{- [«(*) + a(M)]/2}r{- [«(*) + «(«)]/2}r{- [«(*) + «(*)]/2}' 

(1.2.18) 

In contrast to the open string case, where the amplitude A(s, t) only had poles in 
two channels at a time, the amplitude A(s , t , u) has poles in all three channels 
simultaneously, that is, whenever n comes close to Zj 9 the amplitude has a 
pole. 

Last, we mention that although KSV introduced the multiloop interpretation 
of string theory in order to obtain unitarity, the perturbation theory was not 
manifestly unitary. Furthermore, it was not clear how to determine the weights 
of each of these multiloop diagrams. The origin of this problem was that the 
multiloop series was not derived from a Hermitian Hamiltonian, but was simply 
postulated via Eq. (1.2.1). 

Mandelstam [31] gave the solution to this problem by making a conformal 
transformation on the Riemann surfaces of genus g, demonstrating that in the 
light cone gauge the world sheet was equivalent to a string picture in which 
strings split into smaller strings or joined to form larger ones. Because this 
formalism eliminated all redundant modes from the very start and could be 
shown to be Lorentz invariant, the theory was unitary from the beginning. 

Then, Kaku and Kikkawa [32] showed that the theory could be expressed 
as a genuine field theory of strings , where unitarity was trivially implemented 
by an explicit interacting Hamiltonian. Not only did the field theory of strings 
solve the problem of unitarity and fix the weights of the diagrams appearing in 
the perturbation series, it gave the possibility of writing a nonperturbative for¬ 
malism in which certain symmetries (supersymmetry, 10-dimensional Lorentz 



22 1. Introduction to Superstrings 


invariance, etc.) could be broken down to obtain realistic phenomenology. We 
will elaborate more on the field theory of strings in the second part of this book. 


1.3 Supersymmetry 


The bosonic string, as we have seen, can be derived by postulating a simple set 
of assumptions: that the first quantized theory is given by the two-dimensional 
area swept out by the string and that we sum over all conformally inequivalent 
topologies [which generates a set of Feynman diagrams defined on Riemann 
manifolds]. 

However, the physical spectrum of the bosonic theory cannot accommodate 
fermions. One crucial feature of the string model is that it can be generalized 
to include fermion fields defined along the string, which in turn allows us to 
define a new symmetry, supersymmetry , between the bosons and fermions. 
Indeed, the discovery of supersymmetry took place first in the string model. 

We must be careful, however, to stress that there are two types of su¬ 
persymmetry on the string. The first is world sheet supersymmetry , which 
is an unphysical supersymmetry between fermions and bosons defined 
in two-dimensional space. World sheet supersymmetry is manifest in the 
Neveu-Schwarz-Ramond model [33, 34]. 

The second is space-time sypersymmetry , which is defined in 10- 
dimensional space-time, which corresponds to the physical supersymmetry de¬ 
fined in space-time. This physical symmetry is manifest in the Green-Schwarz 
model [35]. 

We will begin with world sheet supersymmetry in the Neveu-Schwarz- 
Ramond model and discuss space-time supersymmetry later. 

Let us introduce a new fermion field which is a vector in space-time but 
transforms as a two-dimensional spinor in the two-dimensional world sheet. 
Then, Gervais and Sakita showed that the Neveu-Schwarz-Ramond (NS-R) 
model could be derived from a new symmetry, called supersymmetry. They 
introduced the Lagrangian [36] 


where 


and 



(1.3.1) 

O 

II 

(1.3.2) 

II 

II 

o 

(1.3.3) 


with the metric {p a , p b } = -2 rj ab , where r] is given by (-1, +1). 



1.3 Supersymmetry 23 


Written explicitly, this equals 

L = \XX — X'X' + i\ffo(d T + 3^)00 + *0i(3 T — 3^)01 ] * (1.3.4) 

This Lagrangian is explicitly invariant under the following: 

SX* = 60^, 8^ = -ip a d a X * €. (1.3.5) 

The energy-momentum tensor can be written as 

Tab = d a X fj, d b X M + l -^Pa db'kn + ~ (Trace). (1.3.6) 

Generally, in field theory, there is a conserved current associated for every 
symmetry given by 


5 3^0 ’ 


3 ^ = 0 . 


(1.3.7) 


For our case, the current associated with the two-dimensional superconformal 
symmetry of Eq. (1.3.5) is given by 


Ja = {(fpa* ,l dbX ll . (1.3.8) 

We can rewrite the superconformal current J a as 

T F = -^dX^ (1.3.9) 

and its Fourier moments as 

Gn= 2 j > i^i z " +(1/2> Tf(z) ‘ (L3 ' 10) 

To quantize the system, we find that the fermion is self-conjugate and that 
W(ff, r), r b (o', t)} = n& ab &(o - cr')rf v . (1.3.11) 

With these commutation relations, the superconformal algebra becomes 


Q 

[L m , L„] = (m - n)L m+n + -(m 3 - m)S m+n ,o, 

[L m ,G r ] = (y-r)G m+r , (1.3.12) 

{ G r , G s } = 2 L r+S + -( r 2 - |)5 r+J>0 , 

where c = 2c/3 and where, if G r is integral moded, we have the Ramond (R) 
algebra, and if G r is half-integral moded, then we have the Neveu-Schwarz 
(NS) algebra. 

This is most easily implemented by forcing the fermion field to have periodic 
(R) or antiperiodic (NS) boundary conditions 

R: foin, x) - r), 

NS : V'oOr. t) = —^(jt, r). 


(1.3.13) 



24 1. Introduction to Superstrings 


With these boundary conditions, the harmonic oscillator decomposition is 
given by 


R: 


NS : 


<1 = -7= E €e- in(t±a \ 

V 2* n=—oc 
1 00 

v ^ reZ+1/2 


(1.3.14) 


where we associate 0 (1) with the + (— 1) sign, and where we have the 
anticommutation relation among oscillators 


R: {<,<} = «._, 

NS: {b?,b v s } = if v 8 r ,- s . { ' ’ 

The states (including ghosts) of the theory, as usual, are given by the 
complete set of harmonic oscillator states 


R: {a» n }{d v _ r }\0)u t 
NS: {at n }{b v _ r }\0). 


where u a is a 10-dimensional (32-component) Dirac spinor. 

With this decomposition in terms of oscillators, we can now give an explicit 
representation of the generators of the superconformal group. For the N S sector, 
the generators are given by 


| 00 | oo 

Lm = ? 52 : : 52 ( r + \ m ) : b -r b m+r ■ , 

n=—cc r =—oo 

00 

G r — ^ ^ Oi- n b r -\. n . 

n =—oo 


(1.3.17) 


For the R sector, the generators are given by 


| 00 j oo 

Lm = 2 : : +2 X! ( W + 2 m ) : d -n d m+n , 

n=—o o n=—oo 

00 

G m — ^ n^m+n- 

n——oo 


(1.3.18) 


Finally, let us define the operator (2, which can be easily derived by using the 
previous expression for Q in Eq. (1.1.45) in terms of any Lie algebra. We find 
that the Faddeev-Popov ghost factor can be written in terms of two commuting 
ghosts fi, y as 

L = d- z y + c.c.y (1.3.19) 

where c.c. = complex conjugate. The superconformal generators must there¬ 
fore be rewritten to include this new factor coming from the b, c and y 



1.4 2D SUSY Versus 10D SUSY 25 


ghosts: 


L gh°s t _ jP ( m + n) : b m _ n c n : + ^ (±m + n) : fi m -„y n :, 

n=—oc n=—oo 

OO 00 

G gh°s‘ = _2 J2 b-„y m+n + J2 { l 2 n - m)c_ n fS m+n . 

n——oo n=—oo 

Finally, Q can be written as 


(1.3.20) 


2 = ^ {L- n c n + G_„y„) - - jP (w - n) : c_ m c- n k 


m+n 


m,n =—oo 


^ (In \ o ^ 

“b / , I 2 m J C-nfi-mYm+n ~b / y Y—m Y—n^m+n CICq. 
m,n=—o o N / m,n=—o o 

(1.3.21) 

As usual, we can check for the vanishing of O 2 , and we find the constraints 


D 


= 10, a = J 


\ (NS), 
0 (R). 


(1.3.22) 


1.4 2D SUSY Versus 10D SUSY 

In previous sections, we presented the gauge-fixed NS-R action, where the 
action was only invariant under global, not local, superconformal invariance. 
This was to show how the theory could be quantized and to show the nature of 
its spectrum. 

We will now present the full NS-R action, with all its invariances intact. The 
key will be to introduce a zweibein e a a and its supersymmetric partner x a , which 
is a two-dimensional world sheet spinor (whose indices we shall suppress). We 
shall use a, b, c to denote flat two-dimensional indices and a, p, y to denote 
curved two-dimensional indices. Then, the action is [37, 38] 

L = daXfl dpx,x ~ 

+ + ^^X a p p P a Xfi), (1-4.1) 

where p a is not a constant matrix, but is multiplied by the two-dimensional 
zweibein 


p = e a p 


(1.4.2) 



26 1. Introduction to Superstrings 


The action is invariant under 


<SX M = 6^, 

W 1 = -ip a €{d a X» - ^x„), 
8el = —2iep a xp, 

^Xa — 


as well as under local Weyl rescaling 


( 1 . 4 . 3 ) 


SX^ = 0 , 

Sf* = -\o\lr*, 
8e a p = oe a p , 

Sx a = \<*Xen 


(1.4.4) 


as well as 

$Xa = ip a rh &e a p = Sx/ffx = 8Xn = 0 . ( 1 . 4 . 5 ) 

We have enough gauge constraints to place the following conditions on the 
zweibein and the gravitino: 


X* = 0. (1A6) 

With these constraints, the action reduces to the linear NS-R action studied 
earlier. 

Although the NS-R formalism is both simple and elegant and can be ef¬ 
fortlessly quantized both covariantly and canonically, there is one extreme 
drawback: it lacks genuine 10-dimensional space-time supersymmetry. No¬ 
tice that the NS-R action only has supersymmetry of the world sheet, that is, it 
interchanges X^ with but it does not interchange space-time bosons with 
space-time fermions. 

By hand, one can impose the Gliozzi-Scherk-Olive (GSO) projection [39] 
on the states of the NS-R model, and the resulting fermionic and bosonic 
states become space-time supersymmetric. The superstring, therefore, pos¬ 
sesses two-dimensional superconformal symmetry at the Lagrangian level and 
space-time supersymmetry at the Fock space level. However, the origin of this 
space-time supersymmetry is very obscure and still not well understood. 

Space-time supersymmetry is a physical symmetry that lies at the heart of 
many of the near-miraculous properties of the string model, including its finite¬ 
ness and lack of anomalies, so we need another formalism in which space-time 
supersymmetry is manifest. This second formalism, equivalent to the NS-R 
formalism after the GSO projection, is called the Green-Schwarz (GS) formal¬ 
ism [35] and explicitly contains space-time supersymmetry by introducing a 
32-component, 10-dimensional space-time spinor. The advantage of the GS 
formalism is that space-time supersymmetry is built-in from the start and can 
be used to analyze the cancellation of divergences and anomalies in the theory. 



1.4 2D SUSY Versus 10D SUSY 27 


The GS action introduces two space-time spinors 9 A , A = 1,2 (we will 
suppress the spinorial index of the spinor). The action is 

5 = --V [dadx{^g^n a • + 2 d'X^e'r^ d p e l 

4a'n J 

- e 2 r„ d p o 2 ) - 2e^0'r M d a e'e 2 r^ d p e 2 ), (1.4.7) 

where 

n£ = d a x^ -w A r^ d a e A , (1.4.8) 

where are 10-dimensional Dirac spinors, and a, fi are local, two- 
dimensional world sheet indices. The A index, however, labels two distinct 
world sheet scalars and not a two-component world sheet spinor. 

The action is explicitly invariant under 

89 a = € A , SX^ — ie A T ll 6 A . (1.4.9) 

In the proof of the invariance of the action, we will use the following identity 
for a 10-dimensional spinor 


= 0 , ( 1 . 4 . 10 ) 


which is only true for the following: 

(1) D = 3 and xjr is a Majorana fermion; 

(2) D — 4 and x/s is a Majorana or Weyl fermion; 

(3) D = 6 and xjr is a Weyl fermion; and 

(4) D = 10 and xjr is both a Majorana and a Weyl fermion. 

We will, of course, take the case when D — 10, so that the spinors are both 
Majorana and Weyl. 

In general, a 10-dimensional complex Dirac spinor has 32 complex compo¬ 
nents. Imposing that it be real (Majorana) and that it have a definite eigenvalue 
under the chiral operator: 

f(i ±r D+1 ), r D+ i = r 0 r 1 r 2 ”-ri > _i t (1.4.11) 

reduces the number of components by half each time, so that a Majorana-Weyl 
spinor in 10 dimensions has 16 real components. 

To show invariance under space-time supersymmetry, we need the following 
definitions. Let us define a spinorial parameter K Aaa , where A = 1, 2, a is a 
local, two-dimensional world sheet index, and a is a spinorial index, which we 
will often suppress. Also, we now introduce the operator , which projects 
onto self-dual and anti-self-dual pieces of a two-dimensional vector: 

P U J = \{g^±€^/^g) 


(1.4.12) 



28 1. Introduction to Superstrings 


which has the following important properties: 

P±gf,P± = P'f. 
Pfg„Pi‘ = o. 


. la 


= P-kI 


c 2a = P?kI 


Then, the GS action is invariant under 


80 A =2iT • n a K Aa , 

8X^ = i0 A F^80 A , 

8(Vgg af> ) = -16 d y 0 l + P* v ic 2p d Y e 2 ). 


(1.4.13) 


(1.4.14) 


Although the GS action is symmetric under a wide variety of invariances, 
including space-time supersymmetry, the problem is that covariant quanti¬ 
zation of this action is exceedingly difficult. This is because the theory is 
highly nonlinear, and the momenta associated with coordinates do not have 
simple properties. For example, one might naively construct the momenta 
corresponding to the spinor: 


XA = -T-~ir»8 a Xy L e A . (1.4.15) 

se A 

We find that the commutation relations are proportional to the inverse of the 
constraints. Thus, the commutators diverge, and the naive approach makes no 
sense. 

The problem is that the first and second constraints of the theory are mixed 
together in a way that cannot be separated. The reason for this is actually quite 
easy to see. 

We saw that a Majorana-Weyl spinor has 16 real components. This is 
the smallest spinorial representation of the 10-dimensional Lorentz group 
50(9, 1). However, if we were to apply all the covariant constraints on 
the spinor, we would have to place one additional constraint, reducing the 
spinor to eight components. But , there are no eight-dimensional spinorial 
representations of SO( 9, 1). Thus, naive quantization of the theory is im¬ 
possible. Only recently, with advances in the covariant quantization method 
of Batalin-Vilkovisky (BV), has the covariant gauge-fixed GS action been 
quantized. 

However, although the covariant quantization of the theory is quite difficult, 
the light cone quantization of the theory is quite easy (although covariance is 
completely lost). Let us apply the light cone gauge conditions 

r + o 12 = o, 

r* = 2~ 1 / 2 (r°±r 9 ), 


(1.4.16) 



1.4 2D SUSY Versus 10D SUSY 29 


so that the Dirac matrices satisfy 

(r + ) 2 = (r~) 2 = 0. (1.4.17) 

Then the nonlinear terms in the action in Eq. (1.4.7) disappear, leaving us with 
the simple action 

S = -^7 j da dz(d a X‘ d a X l - iSp b fyS), (1.4.18) 

where a, bare flat world sheet indices and i , j are eight-dimensional transverse 
indices and where we have substituted \fp+6 with 5. Notice that the two space- 
time spinors 6 A , which were scalars on the world sheet (and not world sheet 
spinors), have strangely transformed into genuine world sheet spinors 5. 

The light cone action has a very simple global space-time supersymmetry 
associated with it. The supersymmetry transformation is given by 

<55* = (2 p + ) ly V, 

8X l = 0, 

>— , . . . (1.4.19) 

8S a = -i^p b d b X\y% a € a , 

8X‘ =(p + )- l ' 2 (y i ) ai € i S a , 

where (y l )aa are the Dirac matrices that generate a representation of 50(8), 
and the parameters of the symmetry are given by rf and e a . 

To calculate the generators of this symmetry, let us first quantize the model. 
Since the action is now linear, this is trivial. We find 

[S Aa (o-, r), S Bb (a r , T)] = 7T 8 ab 8 AB 8(a - a') (1.4.20) 

and there is only one choice of boundary conditions for open strings 


S la ( 0, t) = 5 2 *(0, t), 


S la (n, r) = S 2a (7T, r), 


(1.4.21) 


so that the strings have the same 5 0(8) chirality. 

The generator of these symmetries is given by 

Q a = ( 2p + ) l ' 2 S a 0 , 

00 

Q a = ( P + r l/2 (y% a £ S-X- 

n=— oo 

The generators, in turn, form the supersymmetric algebra 
[Q a ,Q b ) = 2p + 8 ab , 
{Q a ,Q i }=s/2(y i ) ai p i , 

\Q\ Q b \ =2H8 ib , 


(1.4.22) 


(1.4.23) 


where 


1 r 00 . 

— £ (0f‘-nK + nS -n Sa n ) + {Pi 

” L «=' 


(1.4.24) 



30 1. Introduction to Superstrings 


In contrast to the covariant quantization of the GS string, which is quite difficult 
and involved, the light cone quantization of the GS string yields a simple, free 
supersymmetric theory. 


1.5 Types of Strings 

At this point, we may ask, what are the various types of string theories one 
can write that are supersymmetric, ghost-free, and anomaly-free? The easiest 
way to catalog the various possibilities is through the light cone quantization 
of the GS string, since all ghosts have been removed and the theory is globally 
supersymmetric in space-time. 

The list of totally self-consistent string theories consists of: 

(1) type I; 

(2) type IIA; 

(3) type IIB; 

(4) Es ® Es heterotic; and 

(5) 50(32) heterotic. 

It may seem surprising that there are so few self-consistent string theories, 
while there are an infinite number of possible theories of point-particles. The 
reason for this is that the Feynman diagrams of a point-particle are based on 
one-dimensional graphs, upon which we can impose any number of Lorentz 
covariant vectors and spinors (e.g., Feynman’s rules). However, the Feynman 
diagrams of string theory are two-dimensional manifolds, obeying strict self- 
consistency constraints, so it is not surprising that we only find four self- 
consistent string theories. 

Type I 

The first string theory is called Type I, in which, as we have seen, the spinors 
5 1 and 5 2 have the same 5 0(8) chirality. The theory of open strings, by itself, 
however, is incomplete (because the nonplanar one-loop diagram has a pole, 
which corresponds to a closed string). Because the closed string emerges as a 
bound state of open string graphs, we must add the closed string sector to the 
open string in order to maintain unitarity. 

Gauge invariance can be added into the theory by multiplying the TV-point 
function with appropriate traces over the generators of some Lie algebra (called 
Chan-Paton factors). The gauge group must be 50(32) in order to cancel all 
anomalies. 


Type IIA 

For closed strings, there is a choice as to how to choose the chiralities of 5 1 
and 5 2 . If we choose them to be of opposite chirality, then we have the Type 



1.5 Types of Strings 31 


IIA string. Type IIA closed string theory is appealing because it has no chiral 
anomalies from the very beginning (since the two chiral sectors cancel against 
each other). In the zero slope limit, when only the massless sector of the theory 
survives, the theory reduces to the point-particle N = 1, D = 10 supergravity 
theory. 

Type IIB 

For closed strings, if we take the choice where S l and S 2 have the same chirality, 
then we have the Type IIB superstring. However, in the zero slope limit, when 
we analyze the massless sector, we find that there does not exist any known 
covariant version of this theory. Its light cone reduction is well defined, but 
its covariant precursor apparently cannot be written. (This may be because of 
our limited understanding of how to construct point-particle supersymmetric 
theories in 10 dimensions.) 


Heterotic String 

The string theory that comes closest, perturbatively, to describing the physical 
world is the heterotic string [40]. While the Type I string uses Chan-Paton fac¬ 
tors to introduce isospin symmetry, the heterotic string uses the 16 dimensions 
left from the compactification of 26 dimensions down to 10 dimensions to intro¬ 
duce a rank 16 Lie group. Since E% is a rank eight Lie group, the heterotic string 
can be compactified so that its spectrum is ® Es [or Spin(32)/Z 2 ], which 
is certainly large enough to permit a serious phenomenological investigation. 

The heterotic string, however, achieves this compactification in an unortho¬ 
dox fashion. We recall that the closed string has right-moving and left-moving 
oscillator modes, which, for the most part, do not interact. The heterotic string 
splits these modes apart. The left-moving modes are purely bosonic and live 
in a 26-dimensional space which, has been compactified to 10 dimensions, 
leaving us with an E% ® E% isospin symmetry. However, the right-moving 
modes only live in a 10-dimensional space and contain the supersymmetric 
GS or NS-R theory. When the left-moving half (containing the isospin) and 
the right-moving half (containing the supersymmetry) are put together, they 
produce a self-consistent, ghost-free, anomaly-free, one-loop finite theory, the 
heterotic string (meaning “hybrid vigor”). 

The action for the heterotic string is therefore 

16 

da 3 a X‘ d a X i + d ° x ' d a X' + iSy~(d t + d a )S , 
1 = 1 

(1.5.1) 

where I = 1 , 2 ,..., 16 and is an isospin index and where we enforce the 
constraints 



(dr - d„)X / = 0, 


y + 5 = i(l + y„)5 = 0, 


(1.5.2) 



32 1. Introduction to Superstrings 


where y + = 2 ~ l/2 (y°+y 9 ). (Some have criticized the heterotic string for being 
artificial and contrived because of the way it splits the left- and right-moving 
oscillators, indicating that perhaps the heterotic string, in turn, is a broken 
version of an even higher string. However, attempts to embed the heterotic 
string into a larger string theory have not been particularly successful.) As 
we shall see in later chapters, with a mild set of assumptions, we can obtain 
surprisingly realistic string theories that contain the 5(7(3) ® SU(2 ) ® (7(1) 
low-energy theory of our world. 

To understand these compactifications, in the first part of this book, we will 
turn to a discussion of conformal field theory, which will hopefully give us a 
classification of all possible vacuums of the theory. Perhaps one of the millions 
of conformal field theories that have been discovered describes our universe. 

However, this perturbative approach alone can never yield totally realistic 
results. Supersymmetry and 10-dimensional space-time seem to be preserved 
to all orders in perturbation theory, so perturbation theory by itself can never 
break the symmetries of the superstring in order to yield realistic phenomenol¬ 
ogy. In the second part of this book, we will concentrate on nonperturbative 
approaches to superstring theory, especially string field theory, matrix models, 
and M-theory. 


1.6 Summary 

Superstring theory has emerged as the leading candidate for all known forces. 
Not only has the theory enough symmetries to include the four fundamental 
forces as subsets of its symmetries, it is also the only theory that can claim to 
yield a finite quantum theory of gravity. 

The bosonic Lagrangian for the string is given by 

L = —2—'Jgg ab d a Xvd b X v r, llv . ( 1 . 6 . 1 ) 

4na f 

Notice that the action is manifestly reparametrization invariant. If we 
reparametrize the two-dimensional world sheet according to 

a a(a , r), r —> f(a, r), (1.6.2) 

then the action is invariant under this two-dimensional general coordinate 
transformation if 



where x a — {a, r}. The theory is also scale invariant under 

gab -> e*gab. (1.6.4) 

There are enough symmetries in the theory to select out the conformal gauge 

Sab &ab-> (1.6.5) 



1.6 Summary 33 


so the action becomes manifestly conformally invariant: 

L = h dzX ^ diX>1 ' (L6-6) 

In the first quantized formalism, interactions are introduced by the functional 
integral 


A n — 


E /»*./ 

Topologies J J 


d/jL exp 


- e iV* 

Topologies’ 7 \i =1 



N 

L(z ) + '%2ik i ' V X v (z i ) 

i = 1 


(1.6.7) 


We must sum over all conformally distinct surfaces, including Riemann sur¬ 
faces of arbitrary genus, which means that the first quantized formalism is 
necessarily perturbative. This is one of the fundamental deficiencies of the 
first quantized system. 

In the genus zero limit, we can calculate the full spectrum of the theory. To 
do this, we first define the energy-momentum tensor 

<L6 - 8) 

This, in turn, can be shown to equal 

Tab = % b x» - \ gabg cd d c x".d d x^. (1.6.9) 

We define the Virasoro generators L n as the moments of the energy- 
momentum tensor 

L >» = T—,I eim °^+ T») + e- ima (Too - T 0l )da 
4na J 0 

= —— [ e ima (X + X') 2 da 
87ta' J_ n 

j OO 

= 2 E a m-nOtn • (1.6.10) 


These generators, L n , in turn form a closed algebra, the Virasoro algebra 
(which generates the conformal group): 

[L n , L m ] = (n- m)L n + m + j^ n ( n + l)(n - 1 )S„ _ w , (1.6.11) 

where c is a constant. 

In the Gupta-Bleuler quantization scheme, we allow ghosts (due to the 
negative sign in the Lorentz metric) to propagate in the theory. We eliminate 



34 1. Introduction to Superstrings 


them by applying the Virasoro generators on the states 

L n \R) = 0, n > 0, 

(Lo-l)|/?>=0, 


( 1 . 6 . 12 ) 


where the second condition is the mass-shell condition. 

In the alternative BRST formalism, we allow Faddeev-Popov b , c ghosts to 
circulate in the theory and then cancel them by requiring the physical states to 
satisfy 


Qm = o, 


(1.6.13) 


where Q is nilpotent. 

Fermions are introduced into the theory by adding supersymmetry (which 
was first discovered in string theory). 

Let us introduce a new fermion field which is a vector in space-time but 
transforms as a two-dimensional spinor on the two-dimensional world sheet. 
The Neveu-Schwarz-Ramond action is 

L = 3°^ - W*P a 3 a*v), (1-6.14) 

where 


0 -i 
i 0 


(1.6.15) 


and 

^ = (<)’ ^ = ^°’ (L6 ' 16) 

with the metric { p a , p b } = —2 rf b , where r] is given by (—1, +1). 

The N = 1 superconformal algebra is constructed by taking the moments 
of the energy-momentum tensor and the supercurrent: 

c , 

[T w , L w ] — (pi ~(ffl wi)$ m +n,£h 

(L m ,G r ] = (y-r)G m+r , (1.6.17) 

{G r , G 5 } = 2 L r + S + -{r 2 — |)S r+J>0 , 

where, if G r is integral moded, we have the Ramond algebra, and if G r is 
half-integral moded, then we have the Neveu-Schwarz algebra. By a similar 
analysis, the NS-R model is ghost-free in 10 dimensions. 

The NS-R model’s main problem, however, is lack of space-time super- 
symmetry, which can only be restored by truncating the Fock space. An 
equivalent formalism, which is manifestly space-time supersymmetric, is the 



1.6 Summary 35 


Green-Schwarz string 

s = - j da dT[jgg aiS n a • + 2 w* a a x'*(0 , r M a^ 1 

- dpe 2 ) - 2e“^‘r M a B 0 1 e 2 r'* a^ 2 ], (1.6.18) 

where 

= d a X» - a a 0^, (1.6.19) 

where T^ are 10-dimensional Dirac spinors, a , p are local two-dimensional 
world sheet indices, and A = 1,2. This A index, however, labels two distinct 
world sheet scalars, not a two-component world sheet spinor. 

Superstrings are enormously constrained because of the large symmetry 
group and because interactions are defined on manifolds. (By contrast, point- 
particle theories are defined on graphs, and hence an infinite number of them 
can be written.) 

So far, only five superstrings have been discovered: 

(1) For Type I superstrings, we combine open and closed superstrings. The 
theory is anomaly-free for the gauge group 50(32). 

(2) For Type IIA superstrings, we have a closed string theory in which the 
spinors have opposite chirality. 

(3) For Type IIB superstrings, the spinors are of the same chirality. 

(4) For heterotic superstrings, we have a closed superstring theory in which 
the left-moving sector is bosonic and lives in 26-dimensional space and 
the right-moving sector is supersymmetric in 10 dimensions. By compact- 
ifying the 26-dimensional left-moving sector to 10 dimensions, we obtain 
the gauge group Eg ® Eg or Spin(32)/Z 2 . 


The action for the heterotic string is therefore 


S 



16 

d a X‘ d a X '■ + ]T d a X' d a X l + iSy~(.d T + 9 ff )S , 

7 = 1 


( 1 . 6 . 20 ) 

where I labels the Eg ® Eg symmetry and where we enforce the constraints 
(3 r — d a )X' = 0, y + S = £(l + y„)S = 0, (1.6.21) 


where y + = 2 _1/2 (y° + y 9 ). 

The central theme of this book, and also the most pressing problem in su¬ 
perstring research, is the search for the true vacuum of the theory. Notice that 
our discussion has been mainly perturbative. However, because perturbation 
theory can never compactify space-time to four dimensions or break super- 
symmetry, we must turn to nonperturbative formalisms, which are discussed 
in the second half of this book. 

We have written the book in two parts. In the first part, we discuss conformal 
field theories, which give us the complete set of possible perturbative vacuums 



36 1. Introduction to Superstrings 


of string theory. However, the true vacuums of the theory must necessarily 
break supersymmetry and compactify space-time to four dimensions, so in the 
second part of this book, we discuss nonperturbative approaches to superstring 
theory, in particular string field theory and M-theory. 


References 


For introductions to string theory, see Refs. 1 and 2. 

1. M. Kaku, Introduction to Superstrings , Springer-Verlag, Berlin (1989). 

2. M. B. Green, J. H. Schwarz, and E. Witten, Superstring Theory , Cambridge 
University Press, Vols. 1 and 2, London (1987). 

For reviews of the older, dual resonance model, see Refs. 3 to 7. 

3. J. H. Schwarz, Phys. Rep . 89, 223 (1982). 

4. M. Jacob, ed., Dual Theory , North-Holland, Amsterdam (1974). 

5. J. H. Schwarz, ed., Superstrings: The First 15 Years of Superstring Theory , World 
Scientific, Singapore (1985). 

6. J. Scherk, Rev. Mod. Phys. 47, 1213 (1975). 

7. P. Frampton, Dual Resonance Models , Benjamin, New York (1974). 

8. Y. Nambu, Lectures at the Copenhagen Summer Symposium (1970). 

9. T. Goto, Progr. Theoret. Phys. 46, 1560 (1971). 

10. A. M. Polyakov, Phys. Lett 103B, 207, 211 (1981). 

11. J. L. Gervais and B. Sakita, Nucl. Phys. B34, 632 (1971); Phys. Rev. D4, 2291 
(1971); Phys. Rev. Lett. 30, 716 (1973). 

12. S. Fubini, D. Gordon, and G. Veneziano, Phys. Lett. 29B, 679 (1969). 

13. M. A. Virasoro, Phys. Rev. Dl, 2933 (1970). 

14. P. Goddard and C. B. Thom, Phys. Lett. 40B, 235 (1972). 

15. R. C. Brower and K. A. Friedman, Phys. Lett. D7, 535 (1973). 

16. P. Goddard, J. Goldstone, C. Rebbi, and C. B. Thom, Nucl. Phys. B56, 109 
(1973). 

17. C. Becchi, A. Rouet, and R. Stora, Ann. Phys. 98, 287 (1976). 

18.1. V. Tyupin, Lebedev preprint, FIAN No. 39 (1975), unpublished. 

19. L. D. Faddeev and V. N. Popov, Phys. Lett. 25B, 29 (1967). 

20. M. Kato and K. Ogawa, Nucl. Phys. B212, 443 (1983). 

21. E. S. Fradkin and G. A.Vilkoviski, Phys. Lett. 55B, 224 (1975). 

22. G. Veneziano, Nuovo Cimento 57A, 190 (1968). 

23. M. Suzuki, unpublished. 

24. K. Kikkawa, B. Sakita, and M. A. Virasoro, Phys. Rev 184, 1701 (1969). 

25. M. Kaku and L. P. Yu, Phys. Lett. 33B, 166 (1970); Phys. Rev. D3, 2992, 3007, 
3020 (1971); M. Kaku and J. Scherk, Phys. Rev. D3, 430 (1971); Phys. Rev. D3, 
2000(1971). 

26. C. Lovelace, Phys. Lett. 32B, 703 (1970); Phys. Lett. 34B, 500 (1971). 

27. V. Alessandrini, Nuovo Cimento 2A, 321 (1971). 

28. C. S. Hsue, B. Sakita, and M. A. Virasoro, Phys. Rev D2, 2857 1970. 

29. M. A. Virasoro, Phys. Rev 177, 2309 (1970). 

30. J. Shapiro, Phys. Lett. 33B, 361 (1970). 



References 37 


31. S. Mandelstam, Nucl. Phys. B64, 205 (1973); Nucl. Phys. B69, 77 (1974). 

32. M. Kaku and K. Kikkawa, Phys. Rev. DIO, 1110, 1823 (1974). 

33. A. Neveu and J. H. Schwarz, Nucl. Phys. B31, 86 (1971). 

34. P. Ramond, Phys. Rev. D3, 2415 (1971). 

35. M. Green and J. H. Schwarz, Phys. Lett. 136B, 367 (1984); Nucl. Phys. B198 
252, 441 (1982). 

36. J. L. Gervais and B. Sakita, Nucl. Phys. B34, 632 (1971). 

37. L. Brink, P. Di Vecchia, and P. Howe, Phys. Rev. D5, 988 (1972). 

38. S. Deser and B. Zumino, Phys. Lett. 65B, 369 (1976). 

39. F. Gliozzi, J. Scherk, and D. Olive, Nucl Phys. B122, 253 (1977). 

40. D. Gross, H. A. Harvey, E. Martinec, and R. Rohm, Phys. Rev Lett. 54, 502 
(1985); Nucl. Phys. B256, 253 (1986); B267, 75 (1986). 



CHAPTER 2 


BPZ Bootstrap and 
Minimal Models 


2.1 Conformal Symmetry in D Dimensions 

In the original pioneering paper of Belavin, Polyakov, and Zamolodchikov 
(BPZ) [1], two questions were asked. Is conformal invariance by itself suffi¬ 
ciently restrictive to uniquely determine all Green’s functions of a conformal 
field theory? If not, then what additional conditions are necessary before we 
can solve for Green’s functions? 

These questions are not as outlandish as they may seem. For a typical quan¬ 
tum field theory in higher dimensions, the space-time symmetries of the system 
are not strong enough to uniquely determine all Green’s functions. However, 
we know that the case of two dimensions is special: the number of generators 
of the conformal group is infinite. As a result, the restriction of conformal 
invariance creates an infinite number of conserved currents, which are often 
sufficient to solve a two-dimensional quantum field theory [1,2]. 

In general, because of the explosion of conformal field theory solutions 
found within the last few years [3—10], we know that the conformal bootstrap 
method requires new constraints that must be imposed, such as modular in¬ 
variance (as we will see in Chapter 4), to determine the correlation functions. 
There is a class of conformal field theories, called the “minimal series,” for 
which Green’s functions can be computed using only the constraint of unitarity 
and a finite number of primary fields. 

Let us first begin with a general discussion of the conformal group in D 
dimensions, and then single out why the case D — 2 is so special. We define 
conformal transformations as those that leave the metric invariant up to a scale 
change 


8nv(x) g'^ix') = Q(x)g flv (x). 


( 2 . 1 . 1 ) 




2.1 Conformal Symmetry in D Dimensions 39 


Notice that this transformation preserves angles, that is, the angle 

x^ v y v 

V?? 


( 2 . 1 . 2 ) 


between x M and y M is preserved under a conformal transformation. Let us 
represent this with a small infinitesimal transformation, x^ + Then, 
the infinitesimal distance ds 2 transforms as 


ds 2 —» ds 2 + (3^e v + d^^dx^ dx v . (2.1.3) 

Next, we place constraints on so that we can make it compatible with a 
scale transformation on the metric. This means that the right-hand side of the 
previous equation must be proportional to r] fXV , so that 

9/^v 4- d v €^ = —(3 • c)r]^ v . (2.1.4) 

Let us take the trace of both sides of the equation and then compare it with the 
Q found in Eq. (2.1.1). We find 


S2 = l+(2/D)(3-€) (2.1.5) 

so that 

Kv dx a A +(D - 2)9„ a„]3 • e = 0. (2.1.6) 

Notice that the constraint on for D = 2 is different than for D > 2. 

For D greater than 2, let us now tabulate the components of for the 
conformal group: 

6 /i _ a iL generates constant translations ; 


= co^x v generates Lorentz transformations for antisymmetric oof, 

6^ = Xx^ generates scale transformations ; and 

_ 2x^b v x v generates proper conformal transformations. 

The first two are the familiar transformations of the Poincare group. The 
third is a scale transformation, and the fourth is a combination of an inversion 
and a translation. To see this, we can write the last transformation in a more 
transparent way 



that is, proper conformal transformations correspond to an inversion followed 
by a translation. 

If D is the dimension of space-time and D > 2, then the total number of 
parameters in the conformal group is equal to \{D 4- 1)(D + 2). The conformal 
group is thus isomorphic to the orthogonal group on (D+2) x (D+ 2) matrices. 



40 2. BPZ Bootstrap and Minimal Models 


For finite, rather than infinitesimal transformations, we have the following 
transformations: 

x 11 ^x 11 '= x li +a lt , 
x* -> x* = A»x v , 

x* ***' = \x», ( 2 . 1 . 8 ) 

x * ^ v = 

1+2Z7-x + ^ 2 ’ 

(A" parametrizes Lorentz transformations.) 

For the special case of four dimensions, the conformal group is easily con¬ 
structed from the six generators of the Lorentz group, four generators for 
translations, four generators for proper conformal transformations, and one 
generator for scale transformations, for a total of 15 generators. The conformal 
group in four dimensions is therefore 

50(2, 4) ~ SU(2, 2). (2.1.9) 

For the special case of D = 2, we find that the finite number of parameters in 
the conformal group becomes infinite. In this case, the infinitesimal conformal 
transformation becomes 


di€ i =d 2 € 2 , di€ 2 = -%€!, (2.1.10) 

that is, we have precisely the Cauchy-Riemann equations for a two- 
dimensional transformation. If we define e(z) = e 1 4- ie 2 , €(z) = e 1 — ie 2 , 
z(z) = x l + (—\)ix 2 , then the conformal transformation becomes 

z-»z + e(z), z->z + e(z). (2.1.11) 

If we make the following infinitesimal change given by c(z) = — z n+1 and 
e(z) = — z m+l , then it is easy to compute the generators of this transformation 

L„ = -z n+1 d z , L n = -z n+l d- t , (2.1.12) 

which obey the algebraic relations 

[Ln, L m ] (n [L n , L m ] — (n (2.1.13) 

This is called Witt algebra . When the central term is added, it becomes the 
familiar Virasoro algebra [11]. 

In two dimensions, there is a qualitative change in the conformal group 
because the number of generators has suddenly become infinite. This means 
that many two-dimensional models are actually exactly soluble, contrary to the 
situation in higher dimensions, because of the presence of an infinite number 
of conserved currents. Enormous simplifications of the correlation functions 
occur only in two dimensions, sufficient to make a wide variety of models 
exactly soluble. 



2.2 Conformal Group in Two Dimensions 41 


2.2 Conformal Group in Two Dimensions 

Let us now discuss how two-dimensional conformal fields transform. We say 
that a conformal field has weight h\ + h 2 and conformal spin h\ — h 2 if it 
transforms as 

^>=01) G§) ^- ?) (2 - 2 ' i) 

under a conformal transformation. 

The full conformal transformation on 0(z, z) is actually the product of two 
copies of the conformal group, acting on each z and z individually. [Since a 
function 0(z, z) transforms under the product of two commuting conformal 
algebras, we will often delete the dependence on z. It is an easy matter to 
reinsert the dependence on z into all the equations.] 

If we power expand this infinitesimally, we find 

S0(z) = €(z) 3 Z 4>(z) + h d z € (z)0(z). (2.2.2) 

A field that transforms in such a manner is called a primaryfield with conformal 
weight or dimension h[ 1]. 

Notice that the conformal weight of the product of two fields <f) n and <f m (with 
conformal weights n and m) is equal to the sum of the conformal weights, given 
by n + m, that is, 

Mm ~ <t>m+n • (2.2.3) 

Also, notice that if a field has conformal weight 1, then its integral is actually 
an invariant 

5 j dz<j>(z) = j dz d z [e(z)<P(z)] = 0. (2.2.4) 

Example: Free Boson Field 

To illustrate these methods, let us study the simplest of all conformal 
systems, a single free boson field 0. We begin with the Lagrangian 

L = — 30 30 (2.2.5) 

2 n 

from which we can naively define the energy-momentum tensor for the free 
boson field as 

T{z)=-\d z (t>{z)d z (l>{z). (2.2.6) 

When we proceed to the quantum theory, however, this previous expression 
has no meaning. There are two fields defined at the same point in space-time, 
which is formally divergent. To make some sense out of this, we therefore must 
define a method by which to subtract the divergent piece, while maintaining 
conformal invariance. 



42 2. BPZ Bootstrap and Minimal Models 


We can define the normal ordered product in several ways by subtracting 
its divergent piece 

T(z) = -\: 90(z)90(z) : 


— 2 |^90(z) 

= -^lini z _^ U ;[d0(z)90(u;) + (30(z)d0(u;))], (2.2.7) 

where 

(0(w)0(z)) = ~ log(in - z). (2.2.8) 

Yet another way to define the normal ordered product is to simply reshuffle 
the operators contained within the product of two fields so that the creation 
(annihilation) operators appear on the left (right), so that the resulting expres¬ 
sion has a zero vacuum expectation value. Because of the problem of potential 
divergences of two fields multiplied at the same point, we will adopt the fol¬ 
lowing conventions in this book. If two fields are multiplied at the same point, 
we will tacitly assume that the product is normal ordered. Also, when taking 
the correlation function of two fields, we will tacitly assume that they are “ra¬ 
dially ordered,” which is the counterpart of time ordering found in ordinary 
point-particle quantum theory. Radial ordering means that the products of all 
fields are ordered according to their distance from the origin of the complex 
plane. Let us now calculate the conformal weight of the derivative of a free 
boson field 

T(w)d<f>(z) ^ — \ : d(f>(w) d(f>(w) : 90(z) 

~ (~ 2 )\ d<t>{w)(d4>(w)d<j>(z)) 


~ 3 <p(w) 


1 

(w — z) 2 


~ [9000 + (w-z) 3 2 0OO] z y ■ (2.2.9) 

Comparing with our previous expression for the transformation of a field of 
weight /i, we find that 30 has weight 1 when 0 has weight 0. Next, let us 
compute the value of the central term for the free boson field. The operator 
product expansion of two energy-momentum tensors, for the scalar field, is 
given by 

T(w)T{z) ~ 2( — j) 2 ((d w <t> d z <p}) 2 


+ 4(- 
1 1 


5) 2 9 w 0<3u,03 z 0R0 

+ 9 W 0 -- 3 Z 0 

(w — zy 


2 (w - z) 4 



2.2 Conformal Group in Two Dimensions 43 


1 1 
2 (w - z) 4 


+ 


2 

(w — z) 2 


[ -|0,<» 2 ] 


+ <2 ' 210) 

To understand this expression, let us write the general expression for the 
operator product expansion of the energy-momentum tensor 


1 


T{z)T(w) ~ - 


+ - 


2 (z — w) 4 (z — w) 2 


;T(w) + 


1 


(z - w) 


d w T(w)+---, (2.2.11) 


where c is the central charge of the Virasoro algebra. 

Comparing the two expressions, we see that the second term in the expansion 
shows that T (z) has conformal weight 2 (although it is not a primary field 
because of the presence of the central charge), and that the free boson field has 
c = 1. 

This trivial example is important because the string can be viewed as 26 free 
bosons added together 


T(z) = dX„ dX» 


( 2 . 2 . 12 ) 


so the central term is just equal to 26. 


Example: Free Fermion Field The next simplest example is a free fermion 
field, which has c = Let us begin with the free fermion action 

L — Ti/r(z)ai/f(z) (2.2.13) 

2n 

which yields the following energy-momentum tensor: 

T(z)=\: Hz)df(z):. (2.2.14) 

The vacuum expectation value of two fermion fields is therefore given by 

(f(w)^(z)} = - —-—(2.2.15) 
(w - Z) 

We can now repeat all the previous steps, replacing the free boson field by 
the fermion field. We find that 

1 

T{w)if(z) ~ - ——ry^(z) H- (2.2.16) 

(w - z) 1 

so the fermion field has conformal weight equal to \. Furthermore, the operator 
product expansion of two energy-momentum fields now yields 

1 

T(w)T(z) ~ + • • • (2.2.17) 

so the central term for the free fermion field is given by c = 



44 2. BPZ Bootstrap and Minimal Models 


In summary, we found two representations of the conformal group given by 
a free boson and a free fermion with 


free boson: c — 1, h = 0, 

, , . i.i ( 2 . 2 . 18 ) 

free fermion: c = h = + . 

Last, let us analyze the transformation properties of the energy-momentum 
tensor. The generator of conformal transformations will be the Virasoro gen¬ 
erators, which in turn are the moments of the energy-momentum tensor. It is 
straightforward to calculate the product of T with the conformal field <p h (z ) 
with weight h : 


T(z)4> h (w) 


h 

( z - w) 2 + 


- d w (j) h {w). 

z — w 


(2.2.19) 


The Virasoro generators [Eq. (1.1.23)] emerge when we take the moments 
of the energy-momentum operator 


Ln = j > ^ti z ■ n+XT{z), 


oo 

T(z) = £ z- n ~ 2 L, 

n=~oo 


( 2 . 2 . 20 ) 


Let us rewrite these equations in perhaps a more familiar form, in terms of 
commutators. Let us define the generator of conformal transformations as T € \ 

T € = ^€(z)T(z)dz. (2.2.21) 

Then, we can write the variation of the field (pi,(z) as a commutator 


S<p h (z) = [T e ,Mz)] 

[L m , 4> h (z)] = z m+l 9 <p h (z) + h{m + 1 )z m 4> h (z), 
while the variation of the energy-momentum tensor becomes 
[T t , T(z)] = e(z)T( z y + 2*(zmz) + ^ce(z)'". 


( 2 . 2 . 22 ) 


(2.2.23) 


The infinitesimal transformation of the energy-momentum tensor can be 
integrated, giving us the finite transformation under z —>• f(z ) as follows: 


T(z) ( a/) 2 r[/(z )] + — s(f, z), 

where the last term is called the Schwartzian derivative 

9/ 9 3 / - §(9 2 /) 2 


S(.f,z) = 


(9/) 2 


(2.2.24) 


(2.2.25) 


This expression will be useful when we discuss modular invariance in 
Chapter 4. 



2.3 Representations of the Conformal Group 45 


Last, using these techniques, we mention that we can construct the full 
operator product expansion of the superconformal algebra [Eq. (1.3.12)] for 
the NS-R model [see Eq. (1.3.10)]: 


7b(w)7b(z) ~ 
T B (w)T ¥ (z) ~ 
T ¥ (w)T ¥ (z) ~ 


3c/4 
(w - z) 4 

3 

2 

O - z) 2 
c/4 

(w - z) 4 


2 , 1 
+ -- —T b (z) H- 


(w - z) 2 


w — z 


T?(w) H-3 z r F (z), 

w — z 

\ 

H--— T b (z). 

w — z 


3 zMz), 


(2.2.26) 


2.3 Representations of the Conformal Group 

Now, let us try to classify the representations of the conformal group in much 
the same way that we classify the representations of an ordinary Lie group. For 
the familiar example of SU( 2), we know that representations are constructed 
by taking ladder operators L+ and acting on the eigenstate |Z, —/), in which 
m (the eigenvalue of L z ) has its lowest value. (This state is often called the 
“highest weight state.”) In general, the series generated by all products of these 
ladder operators creates the universal enveloping algebra of the system, which 
in turn contains the various representations of the group 


(L + r|Z,-f>. (2.3.1) 

Representations of higher groups, such as SU( 3), are created in the same 
way. Here, there are three sets of ladder operators £/+, V+, and T + , which then 
act on an eigenstate with the lowest values of the quantum numbers. Within 
this series we find the octets, decuplets, etc. Thus, although the number of 
ladder operators that hit the highest weight state is unlimited, the dimension 
of each representation of SU( 3) is finite. 

In much the same way, we construct representations of the conformal group, 
except for several crucial differences. We choose as our ladder operators the set 
L_ n , where n is positive. The highest weight state is specified by two quantum 
numbers, h and c, such that 

L 0 \h,c) =h\h,c), (7 3 2) 

L n \h, c) = 0, n = 1,2,.... ( ’ * } 

The eigenvalue of Lo is called the level number, and c is the central charge of 
the Virasoro algebra. Then, the universal enveloping algebra is created by all 
products of the ladder operators acting on the highest weight state [see Eqs. 
(1.1.25H1.1.27)]: 


— L_ ni , L — n2 , ..., L— nk \h, c ). 


(2.3.3) 



46 2. BPZ Bootstrap and Minimal Models 


It is easy to see that the enveloping algebra does, in fact, form a representation 
of the algebra. For example, if we hit an element of the enveloping algebra 
with L_„, then it obviously transforms this element into another element of 
the enveloping algebra. Also, if we hit this state with L„, for n positive, then 
we can use the commutation relations of the original Virasoro algebra to shove 
this operator to the right, until it annihilates on the highest weight state, thereby 
giving us a new element of the representation. 

The dimension of this collection of products is now infinite, in contrast to 
the Lie algebra case. This representation, constructed out of the enveloping 
algebra, is called a Verma module [13]. 

Let us now introduce an operator language for this representation. Let </>h(z) 
represent a conformal field of weight h. Then, let us define the vacuum state 

{L 0 ,L 1 ,L_ 1 }|0) = 0. (2.3.4) 

The three generators of SL(2, R) given by {Li,L 0 ,L-i} vanish on this 
vacuum. Then, we can show 


|A,c)=0*(O)|O>. (2.3.5) 

where c is the central term of the Virasoro generator. Let us now apply the 
Virasoro generators on this state. Let </> n (z) be a primary field that satisfies Eq. 
(2.2.2). Then, its commutators with L n are given by Eq. (2.2.22). By fixing the 
value of z, it is easy to show that 


L n M0)\0) = L n \h , c) = 0, n > 0, 
Lo</> h (p)\0) = Lo\h,c) = h\h,c). 


In other words, given the fact that (p h transforms as a primary field of weight 
h, the state | h,c) is a highest weight state of weight h. 

Conformal fields that do not transform as primary fields are called secondary 
fields. For example, the derivatives of primary fields, which may have compli¬ 
cated transformation rules under the conformal group, are usually secondary 
fields. To see how secondary fields are constructed from a primary field, let us 
define 


L 



dw T(w) 
(w — z) M 


Let us also define 

= L_ tl (z), L_*>(z) • • • L_ kn (z)fa(z) 


(2.3.7) 


(2.3.8) 


which can be rewritten as 

\{k}) = *A t, ’-* l -~- fc (0)|0> = I_ tl (0)L_ fe (0) • • • L-*.(0)|/t, c >. (2.3.9) 


We see, therefore, that the fields *(z) are secondary fields that are descen¬ 

dants of the original, primary field fa (z). These secondary fields are constructed 



2.4 Fusion Rules and Correlations Function 47 


from derivatives of the original primary fields, which in turn can be composed 
from Virasoro generators acting on the primary field. 


2.4 Fusion Rules and Correlations Function 


In general, there may be an infinite number of primary fields, with each primary 
field in turn having an infinite series of descendant secondary fields associated 
with it. Let the symbol [0 n ] represent the conformal family containing the 
primary field 0 n and all its secondary fields created by acting on it with L_*, 
as in Eq. (2.3.3). Our task is to determine, for a fixed value of c, all possible 
conformal families [0„] and their correlation functions. 

At first, it seems that the categorization of all representations of a conformal 
field theory seems hopeless. However, these conformal operators fortunately 
must satisfy a large set of identities that often make it possible to completely 
solve the theory. 

To see this, let us recall that in ordinary quantum field theory two fields 0i 
and 02 have the following Wilson operator product expansion when the fields 
are close to each other 

<t>\(x)4> 2 (y) ~ L c,< > - y)Oi(y), (2.4.1) 


where the O, 's are a complete set of operators and the C, ’s are singular 
numerical coefficients. 

Similarly, the same can be said for conformal fields, except that we can place 
more constraints on the right-hand side. For example, by equating the scaling 
dimension on both sides of the operator product expansion, we can calculate 
the singularity structure of the C, ’s as follows 


Q 


1 

(jC — y) hl+h2 ~ hi ’ 


(2.4.2) 


where the h's are the scaling dimensions of the various fields. 

Now, let us take the operator product expansion of the two conformal fields 
<p n and 0 m , with conformal weights given by S n and 8 m . We claim that the 
operator product of the two primary fields is given by 

<Pn(z)<Pm(w) ~ LI C nmk{Z ~ w) Sl ~ Sn ~ Sm O k (w ) (2.4.3) 

k 


for some constants C nm k . (The power of z—w is easily determined by examining 
the dimensions of the left- and right-hand sides of the equation.) 

Actually, conformal invariance places even more constraints on the operator 
product expansion. We note that the right-hand side can, in turn, be represented 
by a complete set of primary and secondary fields, denoted 0jf } , where {k} 
indexes the various elements of the Verma module. 



48 2. BPZ Bootstrap and Minimal Models 


Now, let us write the full operator product relation, including all dependence 
on the complex variables z, z: 

4>n(Z, Z)<Pm( 0) C LWm ] Pnm ] 

P {*),{*) 

x (2.4.4) 

The matrix C? m expresses the “Clebsch-Gordan” coefficient found in the ten¬ 
sor product decomposition of two primary fields, with weights n and m, into 
another set of fields labeled by p. (Strictly speaking, this is not a Clebsch- 
Gordan coefficient in the usual sense because the value of c for all primary 
fields is the same. For a normal Clebsch-Gordan coefficient, the values of the 
c’s are additive as we multiply different representations.) 

Conformal invariance is so powerful that we can often determine the precise 
numerical values of all the coefficients appearing on the right-hand side of the 
equation. The numerical calculation is straightforward, but rather lengthy. We 
simply multiply both sides of the equation by |0) and act on both sides of 
the equation with L *. When Lk acts on a primary field, it transforms, as in 
Eq. (2.2.19). However, when Lk acts on a secondary field, it creates many 
other secondary fields as L k commutes past L_ ? , until it annihilates on the 
vacuum. By equating the terms on the left with the terms on the right, we 
can find an iterative procedure to calculate all values of and Thus, 
conformal invariance alone is sometimes sufficient to determine the value of 

all CL [I]- 

We wish to express this rather lengthy equation in shorthand. We simply 
write [1]: 

[Mx[4>n l ]='£ C nM (2-4.5) 

k 

where we suppress the presence of all secondary fields by putting everything 
in brackets. We implicitly assume that the infinite series of coefficients that 
we have suppressed in the above equation can be numerically calculated by 
hitting both sides with a conformal transformation and then equating terms, 
order by order. 

In principle, if we knew the values of the “structure constants” C k nm , then 
we would actually know everything about the representation of the conformal 
field theory. Most of the work in solving conformal field theory reduces to 
determining, for a fixed value of c , the number of conformal families [cp n \ 
and the structure constants C% m created by taking products between them. 
(Once the C? m are known, then conformal invariance alone will determine 

theAL'*'-) 

In general, conformal invariance alone is not powerful enough to determine 
the fusion rules among the primary fields. Outside input is required. To see 
this, let us first construct the correlation functions between primary fields. For 
the two-point function, conformal invariance alone is sufficient to determine 



2.4 Fusion Rules and Correlations Function 49 


the correlation function up to a constant 

[K{z\)4>n 2 (z 2 )) = ( 2A6 > 

This is proven by taking the conformal transformation of both sides of the 
equation. The transformation under e(z) = const forces the right-hand side to 
be a function of Zu — z 1 — Zz, and the transformation under e(z) = z 2 fixes 
the conformal weights <$,. 

Similarly, using only the transformation rules of the conformal group, we 
can determine the correlation function of three primary fields, up to a constant, 
in terms of = Zi — Zj'. 

(fl • (2.4.7) 

Conformal invariance does not fix the value of the structure constant itself, 
however. 

For products of four or higher fields, the situation gets worse. For example, 
for the four-point function, we have 

( n ^ fo)\ = f(x) n z?~ Sj+S ’ (2.4.8) 

\/ = l / i <j 

where 8 = $ f /3 and 


x = Z 12 Z 34 /Z 13 Z 24 • (2.4.9) 

Conformal invariance can reduce the correlation function to functions of 
but it is not sufficient to determine the function f(x). 

However, we can exploit one more condition, that the product of four primary 
fields is associative, that is, we can take the pairwise contraction of primary 
fields in two different ways and get the same answer. Thus, by pairing the four 
primary fields in two different fashions, we have yet another constraint on the 
correlation function. In Fig. 2.1, we see how the fusion rules may be used in 
two different ways, by pairing different sets of primary fields within the same 
correlation function. Since the final answer is independent of the way in which 
the pairing takes place, we have a new restriction on the functions appearing 
in the correlation function. 

For simplicity, let us first fix the values of the z ’s to be z\ = 00 , z 2 = 1, Z3 = 
x, Z 4 = 0. Then, the full four-point function, as a function of both x and x, 
becomes 


G nm( X ’ X ) ~ { 4 >k(Z\,Zy) 4 >l(Z 2 , Z 2 )<f>n(Zz, Zl)<t>m(Z4, Z4)) 

= (k| l)<p n (x, x)\m). 


(2.4.10) 



50 


2. BPZ Bootstrap and Minimal Models 



k m 

FIGURE 2.1. 


Let us now perform the contractions by pairing the primary fields and then 
using the fusion rules. For example, by pairing the nth and mth fields, we find 

G l L(x, *) = £ CLCu P Fli(p\x)F l n k m (p\x), (2.4.11) 

P 

where 

(k\<p[(l, \)L- k ,L- k2 ...L- kN \p) 

{k\Uh DIP) ' ( ‘ } 

The function F l f m {p |x) is called a conformal block[ 1], because higher-order 
correlation functions can always be written in terms of such blocks. They are 
the building blocks by which we can write arbitrary correlation functions. 

Now, the key step is to pair the primary fields in different fashions using the 
fusion rules and then compare the results of different sets of pairings. Since 
the final result must be the same, we have a new set of identities. Taking the 
pairwise contraction in two different ways, we find 

P 4 

(2.4. 1 3) 

which expresses the fact that the operator product expansion is associative. 

This still leaves the problem of how to actually solve for the conformal blocks 
F l f m {p\ x ), using our knowledge of conformal invariance. One way of tackling 
this problem is to insert a T (z) operator into the correlation function. Because 
the commutation relation between a T (z) operator and a primary field gives us 
back a primary field (without the T insertion), we can write Ward-like identities 
for the correlation functions. These identities relate the correlation function 
where T is inserted and a differential equation on a correlation function where 
T has disappeared. 

Let us now insert T € into the following correlation function: 



2.5 Minimal Models 51 


n 

- ^2(<Pi(m)---S e <pj(.Wj)---4) n (w n )). (2.4.14) 

7 = 1 

By inserting the value of the variation of a primary field, we find our final result 


{T(z)<pl(Wl)(j>2(W2)' 
A,- 


= E 


i =1 


■<Pn(w„)) 

i a 


+ 


(z - Wj ) 2 z - m, 3m, 


where we have repeated Eq. (2.2.19): 

T{z)<pi{Wi) ~ A ' <pi(w,) + 


{z - u>) 2 


z - 


(<pi(w { )- ■ -(t) n (w n )) 


— d Wi (f>(Wi). 

w 


(2.4.15) 


(2.4.16) 


In general, the differential equations for the conformal blocks are too difficult 
to solve, especially if there are an infinite number of primary fields. 

Unfortunately, this is the case for string theory, which has an infinite number 
of primary fields given by the real states | R) , which satisfy Eq. (1.1.25) or Eq. 
(2.3.6). Reinterpreted from the point of view of conformal field theory, we 
see that the spurious states | S) in (1.1.27) form the secondary states of the 
Verma module labeled by |/?). Thus, the Fock space of string theory can be 
decomposed in terms of an infinite family of Verma modules. Each module 
consists of a real state | R) (which is a primary field) and the infinite number of 
spurious states |S) associated with each primary state, which are the singular 
secondary states. 

Although the correlation functions for the full string model are too difficult 
to solve exactly for all Verma modules, we will be interested in a subclass 
of conformal field theories that have only a finite number of primary fields, 
which we will show is exactly solvable. Although the conformal field theories 
with a finite number of primary fields are not very physical, they will give us a 
theoretical laboratory in which to test many of our ideas concerning conformal 
field theory. 

It can be shown that if there are a finite number of primary fields, then the 
values of h and c take on rational values. These are called rational conformal 
field theories and will be studied in this and the next chapter in connection with 
Kac-Moody algebras. The simplest of these rational conformal field theories 
are called minimal models , which we will now study. 


2.5 Minimal Models 

One important question to ask of any representation is whether it is reducible 
or not. For ordinary Lie groups, we can construct the scalar product between 



52 2. BPZ Bootstrap and Minimal Models 


the various elements |a,) of a representation and treat it as a matrix (a, lay). 
Then, the representation is reducible if the determinant of this matrix is zero. 

Similarly, we can determine whether a Verma module is reducible or not by 
taking its elements |{n}) in Eq. (2.3.3) and forming the scalar product between 
them. Then, the representation is reducible if the determinant of this matrix is 
zero, that is, if 

det({m}|{n}) = 0. (2.5.1) 


This is not a trivial task, because the elements within a Verma module grows 
rapidly, as the partition of the level. For example, at the first level, we only 
have one element, given by L_ t \h), so the matrix has only one element 

{h, c|LiL_, | h, c) = l(h, c\L 0 \h , c) = 2h. (2.5.2) 


However, at the second level, we have two members of the Verma mod¬ 
ule: L 2 _ x \h,c) and L_ 2 \h,c). To determine whether they are truly linearly 
independent, we must form the 4 x 4 matrix given by 


det M 2 = 


(h\L 2 L_ 2 \h) 

{h\L 2 L 2 _ x \h) 


(h\L]L_i\h)\ 

{h\L 2 L 2 _ x \h)j 


4 h + c/2 
6 h 


6 h 

4h(\ + 2 h) 


(2.5.3) 


The determinant of this matrix, in turn, can be written in terms of h 9 which 
factorizes nicely 

det = 2(16/z 3 — 10 h 2 + 2 h 2 c + he) 

= 32[h- h lA (c )] [h - hylic)] [h - h 2A (c )], (2.5.4) 

where 

&u(c) = 0, _ (2.5.5) 

hyiic) = 1^(5 — c) =F isV(1 - c)(25 - c) = h 2A . 

We see that the determinant conveniently factorizes into a product of factors, 
which vanishes if h equals one of the h rs . Although it seems like a hopeless 
task to generalize this equation to all levels, we now use a remarkable formula 
due to Kac [12-15], which states that the determinant at the nth level is given 
by 

det M n = fj Mh. c) p(n - k \ (2.5.6) 

k=\ 


where 


4>k(h, c) 


h p,q {c) 


nt*- 

pq=k 

[(m + 1 )p - mq] 2 - 1 


(2.5.7) 


4 m{m + 1) 



2.5 Minimal Models 53 


where p(n ) is the partition of the integer n (i.e., the number of ways in which 
n can be written as the sum of smaller integers) and where the parameter c is 
related to m via 



where p and q are positive integers. This formula is one of the most important 
tools that we have at our disposal for understanding the representations of the 
conformal group. We will refer to this formula throughout the first part of this 
book. 

Ordinarily, if h ^ h p q , then the Kac determinant does not vanish, and 
the representation is irreducible. In general, however, this case is exceedingly 
difficult to analyze. However, some of the more interesting cases occur when 
the Fock space is reducible, that is, when it contains linear relations between 
the various secondary states. We will study these models because many of 
them can be solved exactly. 

If h — h p , q , then the Verma module is reducible, but one can (by suitable 
truncation of the secondary states) extract a smaller subspace of the reducible 
Verma module that is irreducible. The advantage of this truncation is that, 
although the number of primary states 0„ may in general be infinite, for these 
reducible models, the number of primary fields actually becomes finite. 

For this reason, these models with a finite number of primary fields are 
solvable and hence provide us insight into the more difficult (and physically 
relevant) case of an infinite number of primary fields. 

If h = h p q for some p and q, then the Kac determinant is zero, the rep¬ 
resentation is reducible, and there exists a state |x) at the Nth level whose 
matrix elements with other states in the module at level N vanish. However, 
the matrix elements between states with levels N and M (N ^ M) vanish 
(since we must have an equal number of creation and annihilation operators 
sandwiched between (0| and |0)). Thus, |x> is a null state that has vanishing 
matrix elements with all members (01 of the Verma module, that is, 


<0lx) = O, (x lx) — 0. (2.5.9) 

The level of |x) is equal to h PA + pq. The important point, however, is that 
lx) can be a primary field. To see this, notice that each secondary field (01 
appearing in the above equation consists of products of L k . We see that L k 
therefore annihilates |x), that is, 


L n |x) = 0, L 0 |x) = (h p ,q + pq) lx)- (2.5.10) 

The fact that reducible Verma modules contain secondary null states |x) 
that are by themselves primary states means that there is a “smaller” Verma 
module contained within the larger one with |x) as its highest weight state. 



54 2. BPZ Bootstrap and Minimal Models 


This new Verma module, contained within the larger reducible Verma module, 
is called [(f) p , q ]. 

(This smaller Verma module, however, may also be reducible. There may 
be, in turn, null states within this module. However, as we shall see in Chapter 
4, we can systematically extract these null states within [0 M ] until the final 
result is irreducible.) 

Example: Null States It is instructive to construct some of these null states 
explicitly. For example, the secondary state at level 1, 

|X> = L- X \h,c) (2.5.11) 


is a null state if h = 0, for any value of c, because its norm is equal to 2 h, as in 
Eq. (2.5.2). By applying L n on this null state, we see that it is also a primary 
state. 

[There is another, more transparent way of seeing this. We know that L_i 
can be represented in z space as the operator 3, so the state in question is 3 4>h- 
This state, for arbitrary h, is not a primary state (because the derivative acts 
on the Jacobian appearing in Eq. (2.2.1). However, if h = 0, then there is no 
Jacobian factor in Eq. (2.2.1), and 3 <f) h is a primary field. Thus, for a reducible 
module, a secondary field has become a null primary field. This is analogous 
to the general theory of relativity, where the derivative of a vector d p (j> v is not 
a covariant tensor, but the derivative of a scalar 3^0 is a genuine vector.] 

At the second level, let us try 

lx) = (L- 2 +aLi 1 )|A,c>. (2.5.12) 

Demanding that this state be annihilated by L\ and L 2 fixes the following: 
a = |(2 h + 1) and c = 2h(5 — Sh)/(2h + 1). 

Now, let us analyze the unitarity properties of minimal models. If we are 
looking for conformal models with a finite number of fields, then BPZ [1] 
found that the minimal models are labeled by two numbers, m and m', which 
are relatively prime positive integers, such that 


minimal series: 


c — 1 — 6 (m — m') 2 /mm', 

h pq — (4mm')” 1 [(pm' - qm ) 2 - (m - m') 2 ] . 

(2.5.13) 


The areas of most interest for us, however, are those states that are unitary, 


that is, those representations where the Kac determinant has only positive 
eigenvalues. Let us now analyze the unitary representations of the Virasoro 
algebra (i.e., representations that have a positive norm). Analyzing the Kac 
formula, we find that there are three regions of interest. In the region c > 1, 
h > 0, the Kac determinant has no zeros at all and hence the representations 
are irreducible. For the region 1 < c < 25, however, m is not real, and the 
h p ^ q s have an imaginary part or, for p = q, are negative. For c > 25, we can 
choose the branch — 1 < m < 0, and all h p , q s are negative. 

We can also show that the representations are unitary in this region. For very 
large h , the diagonal elements along the Kac matrix dominate the matrix, and 



2.5 Minimal Models 55 


they are all positive. Thus, the matrix has positive eigenvalues for large h. But, 
since the determinant never vanishes for c > 1, h > 0, all of the eigenvalues 
must stay positive in this region, and hence the representation is unitary. For 
c — 1, the determinant vanishes at h — n 2 /4 and never becomes negative. So, 
there is no obstacle to being unitary. 

For 0<c<l,/i>0, the situation is rather delicate. Naively, we can show 
that this region is nonunitary. Let us draw the curves formed by h = h p , q (c) 
in the h, c-plane. For each set of integers p,q, we have a curve in the h, c- 
plane. Then, one can show, by graphical methods, that any point in the region 
0 < c < l,h > 0, can be connected to the c > 1 region by a path that crosses a 
single curve of the Kac determinant. This shows that the determinant reverses 
sign passing through the curve, which proves the existence of negative norm 
states and hence unitary representations are excluded from this region. 

There is, however, a loophole in this demonstration. It may turn out that 
the determinant vanishes along the curves. We know, however, where the de¬ 
terminant vanishes, and that is given by Eqs. (2.5.6)-{2.5.8). We find that the 
representations are unitary for the following discrete set of values [16]: 

6 

unitary senes : c=l---—, m = 3 , 4 ,.... (2.5.14) 

m(m + 1) 

(If we set m' = m + 1 in the minimal series in Eq. (2.5.13), we obtain 
the unitary series.) Notice that h PA has certain symmetries that enable us to 
establish some of the structure of the representation space. In particular, notice 
that it is symmetric under 

p m - p, q m + 1 - p, (2.5.15) 

Thus, if we allow q to range from 1 < q < m, then there are a total of 
m(m — 1) values of h p ^ q9 each appearing twice. It is sometimes convenient to 
display the allowed values of h Ptq in a grid. We choose p to label the horizontal 
axis (increasing from left to right) and q to label the vertical axis (increasing 
from bottom to top). Then, the allowed values of h p%q for m — 3, 4, 5 are 

(\ J6 0 \ 

3 _3_ J_ 

5 80 10 

_]_ ± 3 

10 80 5 

J_ _3_ 3 

10 80 5 

U & \) 


) 



(2.5.16) 



56 2. BPZ Bootstrap and Minimal Models 


Example: The Ising Model 

The case m = 3 is most interesting because it will correspond to the critical 
point of the two-dimensional Ising model, which has been extensively studied 
in the field of statistical mechanics. At the critical point of the Ising model, 
the correlation lengths become infinite, and the theory loses all reference to 
any scale, that is, it becomes scale invariant. Thus, we expect to find the Ising 
model at the critical point among the list of various conformal models. 

We recall that we can write down null states at the second level for various 
values of h and c. In particular, for the Ising model, we have three primary 
fields, with conformal weights 0, 

If we have left-right symmetric fields, then the field (~, is called the 
“order parameter” or, and the field (^, is called the “energy operator” e. 
(This will be discussed further in Chapter 6.) 

In conformal field theory language, the energy operator can be written as 
two free fermion fields i/fx//. The order parameter, however, cannot be written 
in terms of the fermion field t/r, since we cannot construct a field with weight 
~ starting with a field with weight ^. (In the next chapter, we will show that the 
cr field can be written as a spin field S using the mechanism of bosonization.) 
In addition to a, there is also the “disorder” parameter p in the Ising model 
at criticality. The disorder parameter has the same conformal weight as o (but 
has different product ordering with \jr). 

These fields, in turn, allow us to construct null states for the m = 3 minimal 
series. From Eqs. (2.5.12) and (2.5.13), we have 


{L- 2 -\Ll x )\h = \\ 
(L_ 2 -\L\)\h = ±). 


(2.5.17) 


If we insert these null states into any matrix element, then it is sure to 
vanish because these states have zero matrix elements with all elements of the 
Verma module. Thus, given the fact that Virasoro operators can be converted 
into partial derivatives, we can derive differential equations that are satisfied 
by the correlation functions. These differential equations, in turn, allow us to 
completely calculate many of the lower-order correlation functions. 

Let us define Q t2M2N) to be 


(<cx(zl, Zl) • • -Cf{Z2M, Z2 m)^(Z2M +\, Zm+i) ‘ ‘ ' l^(Z2M+2N, Z2M+2 n))- (2.5.18) 

Because of the differential equation satisfied by correlation functions with 
null states, we find the following differential equation for G (2M,2Af) : 


4 9 2 
3 dzf 


2M+2N 

E 


_ 1 _ 

16 


+ 


(Zi ~ Zjf Zi 


Zj dZj 


g (2m,2ao = 0 (2.5.19) 



2.5 Minimal Models 57 


Let us now consider the case when M — 2 and N = 0. Then, the differential 
equation simplifies to the following equation: 




f(x,x) = 0, 


(2.5.20) 


where 


G {4 ' 0) (Zi ) = [(Zi - Z3XZ2 ~ Z 4 XZ 1 - Z3XZ2 ~ Z4)] 1/8 T(x, x) (2.5.21) 


and 


x = 


(Zl ~ Z 2 )(Z 3 ~ Z 4 ) 
(Zl - Z 3 )(Z2 - Z4) 


and Y = [xx(\ - *)(1 - x)]~^f(x, x). 

There are two independent solutions to this equation given by 


(2.5.22) 


= (i ± vr^) i/2 . 


(2.5.23) 


The complete solution for Y thus contains four possibilities f±(x)f±(x ). There¬ 
fore, the method of null states is quite powerful, often giving us an explicit 
solution to the correlation functions. 

Notice that most of our discussion in this chapter has been quite general and 
often did not depend in any way on the particular model being studied. Thus, 
any two-dimensional theory, which becomes conformally invariant, must have 
representations given by the above analysis for various values of h and c. 
In particular, the minimal conformal field theories can be shown to describe 
certain integrable (solvable) models found in statistical mechanics, such as 
the Ising model. Because a second-order phase transition in an integral model 
corresponds to an infinite correlation length, the theory loses all references 
to any scale, that is, we have a scale invariant theory at the critical point. 
By comparing the critical exponents of various statistical models, we find the 
following one-to-one correspondence between minimal models and various 
integrable statistical mechanical models at criticality [17, 18]: 

m = 3 -> Ising model, 
m = 4 -> Tricritical Ising model, 

m = 5 —► 3 — state Potts model (2.5.24) 

m = 6 —> Tricritical 3-state Potts model, 
m arbitrary RSOS models. 

This relationship between statistical mechanical models at criticality and 
conformal field theory will be explored more fully in Chapter 6. 



58 2. BPZ Bootstrap and Minimal Models 

2.6 Fusion Rules for Minimal Models 


We now wish to use the various identities that we have established to calculate 
the fusion rules for the minimal models, which determine the operator product 
expansion of all the primary fields. 

Our first goal is to find the fusion rules for the product of a minimal field 
0i,2 and some arbitrary primary field 0 A with weight A: 

0 i, 2 (z) 0 a(w) = const(z - w) k [(/)& + (z - w)0^ 1} H-]. (2.6.1) 


Our task is first to determine the possible values for A' on the right-hand 
side of the equation and then to generalize both 0 h2 and 0 A to become arbitrary 
minimal primary fields. 

In general, it is impossible to determine the values for A' on the right- 
hand side of the equation for an arbitrary conformal field theory without more 
information. We need extra input, which will be that the fields are primary 
fields for the minimal model. 

Our strategy is to take the matrix element between a null field x and a product 
of ordinary fields 0,. Because the resulting correlation function is equal to zero 
and because the null field x can be decomposed in terms of Virasoro operators, 
we then arrive at a differential equation involving the correlation function. 

The key assumption that we will use is that the primary field 0i,2 is a null 
field and can be written explicitly, as in Eq. (2.5.12), where the values of the 
coefficient a and c are given in Eq. (2.5.13). Because the L n operators can all 
be written in terms of differential operators, we find that this null field, via 
(2.5.12), can be written as 


# (— 2 ) . 
XS+2 — 01 + 


3 a 2 
2(25 + 1) Jz? 


0<$> 


( 2 . 6 . 2 ) 


X&+ 2 has conformal weight 5 + 2, where 5 can be solved by inserting Eq. (2.5.8) 
into Eq. (2.5.13): 

3 = M 5 - c± V(c-1)(c-25)]. (2.6.3) 


The term <+ 2) contains the operator L_ 2 , which in turn can be written 
as a differential operator. Anytime a secondary or descendant field enters a 
correlation function, we can extract the energy-momentum tensor and hence 
write the correlation function as a differential equation 

(0-M,-* 2 ,. ( 2 )0i( Z i) ... (p N ( ZN )) 

= L-k M (z, Zi)L- kM _Xz, Zi)((/)n(z)<p\(zi) ■ ■ ■ <Pn(zn)), (2.6.4) 

where 


L-kiz, zd 



(1 - k) A, 
(z - Zk) k 


1 A" 

( Z-Zi) k ~ l dZi _ 


(2.6.5) 




2.6 Fusion Rules for Minimal Models 59 


If we take the matrix element of this null state 0 2 ,i and a product of several 
fields <f>i(Zi) 9 then the result must be zero. However, by writing the null vector 
in terms of Virasoro operators and then converting Virasoro operators into 
differential operators, we find the following differential equations: 


j o v - ' 1 0 

2(2 8 -I- 1) 3 z 2 (z - Zi ) 2 j-* z - Zi dzi 

x (0u(z)0i(zi) • • • 4*>n(zn )) = o. (2.6.6) 

Take the most singular term as z -> Z\- Using Eqs. (2.6.1) and (2.6.6), we 
have 

3 k(k — 1) . 

mrij- A+K=0 ’ *=■ A ~ A ~ S - (2 - 6J) 

Solving, we find two solutions 

A'(i) — A 0 + \(a 4 - a±) 2 = 8(a + a±), 

„ (z.o.o) 

A' ( 2 ) = A 0 + \(a - a±) 2 = 8(a - a±), 


where 


A 0 = 


a± = 


(c- 1) 

24 ’ 

VT^±V25~^ 


V24 


(2.6.9) 


and 3(a) = A 0 + \ct 2 . This is the equation that we want. We now have the 
possible values of A' that appear on the right-hand side of Eq. (2.6.1). 

Thus, the fusion rules give us 


01,20a = [0*-« + ] + [0a+aj. (2.6.10) 

where the conformal fields in the Verma modules on the right-hand side have 
conformal weight 5(a 4 a+). 

Now, let us gradually generalize the left-hand side of Eq. (2.6.1). If we 
replace <f >with then the fusion rules can also be calculated using the 
same techniques, that is, replace with a null field, take its matrix element 
with a product of fields, and then rewrite the Virasoro generators as differential 
operators. Then, the fusion rules give 

1 +m 1 +n 

[0n,ra] X \cj>a\ — ^ ^ ^ ' [0o;-f/Q: + +fca + ]’ (2.6.11) 

l = \-m k=\-n 

where the height weight fields on the right-hand side have conformal weight 
8(a 4 - la _ + ka+). 

It is now a straightforward process to generalize as a minimal field, which 
would then give us the fusion rules for all minimal fields. After a bit of work, 



60 2. BPZ Bootstrap and Minimal Models 


we find that all the fusion rules for the minimal fields are given by [1]: 

min[/>i +P 2 ~l, minfai+^-l. 

2 m-l-(pi+p 2 )] 2m+\-(q\ +<? 2 )] 

[<P Pl ,iJ x [<t> P2 , q2 ] = J2 E [*».«]• (2.6.12) 

P3 = \Pl~P2\ + l ^3 = l<?l-<?2l+l 

2.7 Superconformal Minimal Series 

Now that we have examined the conformal properties of the minimal models, 
let us make a few remarks about the superconformal generalization of the 
minimal models. 

The calculation of the superconformal minimal series proceeds in much 
the same way as in the conformal minimal series, so let us quickly review 
how we obtained them. First, we construct the Verma modules, created by the 
action of the generators of the algebra on some vacuum state. Then, we contract 
these Verma modules, creating the Kac determinant in Eqs. (2.5.6)-(2.5.8). For 
certain values of h and c, the Kac determinant does not vanish, and then we 
have an irreducible representation of the algebra. However, there are usually 
an infinite number of primary fields associated with this representation, so we 
are more interested in the values of the Kac determinant that do vanish. 

When the determinant vanishes, there is a zero norm state, which in turn can 
be used as the primary field of its own module. However, this module, in turn, 
contains zero norm states as well. After all extraneous null states and their 
secondaries are extracted, we find an irreducible module [</> p , q ]. The fusion 
rules close on a finite number of such primary fields. This allows us to extract 
a finite number of primary fields, giving us the minimal series. 

To analyze the superconformal series, we will find it convenient to introduce 
the Grassmann variable 6 so we can combine the bosonic energy-momentum 
tensor with the fermionic superconformal current into one superfield: 

T(z,9) = T ? (z) + 9T B (z) = ]Tz-"- 3/2 :r F ,„ + 9z- n - 2 T B , n , (2.7.1) 

n 

where 7 b is the usual bosonic current, with conformal weight 2, and 7> is the 
superconformal current, with conformal weight |. 

Then, the superconformal algebra [Eq. (2.2.26)] can be deduced from just 
one operator expansion 

T(lu 0\)T(Zl, 6 l) ^ \cZ \2 + (\9\lZ\2 + 2 Z 12 1j ^ 2 "b <h)T(l 2 > 62 ), 

(2.7.2) 

where: 

Z\2 = Z\ — Z 2 ~ 61621 6 n = 6 \ — 62 , D = de + 6 d z . (2.7.3) 

The central charge of the Virasoro algebra is now normalized to c — 32/2. A 
free scalar superfield now consists of a scalar field with c — 1 and a Majorana 
fermion with c = combined together in a superfield with 2=1. 



2.7 Superconformal Minimal Series 61 


A conformal superfield 0(z, 6) transforms as 

T(zi, 0 x)4>{z 2 , 0 2 ) ~ hd\ 2 Zi 2 <P + \zx 2 D 2 <f> + 9\ 2 z\l d 2 (p, (2.7.4) 

which generalizes Eq. (2.2.19). 

This can be used to generate the commutation relation 

[r„, <P(z, 9)] = vd<p+ l -{Dv)D<t> + h(dv)<t>. (2.7.5) 

The key to the construction of the minimal series is the Kac determinant 
formula. Let us construct Verma modules for the NS sector, whose elements 
are given by ladder operators G_„ acting on the highest weight vacuum state: 

G- nt G-„ 2 ■ ■ • G-„ N \h, c). (2.7.6) 

Notice that we do not have to have to add the usual Virasoro generators, 
since the superconformal generator G_„ is the “square root” of the Virasoro 
generator, as can be seen from the anticommutation relations. 

To construct the vacuum for the NS-R theory, let us define the state |0), 
which is annihilated by all five generators: L_i, L 0 , L\, G\ /2 , and G_ \/ 2 , 
that is, it is invariant under the action of the group Osp( 2| 1). Then, a highest 
weight state with conformal weight h can be constructed from a field 4>h(z, 9) 
as follows: 


I h) = <t> h { 0, 0)|0>. 


(2.7.7) 


Notice that the highest weight vacuum \h) is annihilated by all generators 
with positive indices. 

Let us first analyze the Neveu-Schwarz sector. The determinant of this 
matrix, at level n, is given by [13,19-21]: 

det(M„) - n [* - K.^)] PHs(n ~ pql2 \ (2.7.8) 

PA 

where the product is over positive p, q subject to the constraint that pq/2 < n 
and p — q is even; /?ns is given indirectly by taking the coefficients of the 
following power expansion: 


jj^vpNs(fc)= n 


k=0 


k =1 


(1 +t k - l/2 ) 

(1 — t k ) ' 


To define h p q , let us first introduce m, defined by 

„ „ 8 
c(m) = 1 - ■ 

m(m + 2) 


Then, we define h P n as follows: 


hp, q — 


[(m + 2 )p — mqf — 4 
8 m(m + 2) 


+ A[> 


(2.7.9) 


(2.7.10) 


(2.7.11) 



62 2. BPZ Bootstrap and Minimal Models 


Let us analyze this formula in the same way we analyzed the conformal 
determinant. For c > 1 and h > 0, all representations are unitary. For c < 
1, however, we have the possibility that there is a discrete series of unitary 
representations. 

The minimal unitary representations of the conformal algebras with c < 1 
are given by 

c — c(m), m — 2,3,4,, 

„ (2 712 ) 

h = h p q (m), 1 < p < m, 1 < q < m + 2. 

The determinant formula for the Ramond sector is a bit more delicate. The 
vacuum |0), we saw, belonged to the NS sector of the theory. To create a 
fermionic vacuum, we need to multiply the bosonic vacuum |0) by a spin field 
S^z), that is, 

l^ ± > = ^(0)10), \h~) = G 0 \h + ). (2.7.13) 

We will give an explicit representation of the spin field S(z) in the next 
chapter. However, for our purposes, we only need to know its transformation 
properties, not how to construct it. 

The vacuum state j/z^) is actually a fermion. Thus, there is a chirality operator 
T = (— 1) F , where F is the fermion number, which splits the vacuum into two 
pieces 

r\h ± ) = ±\h ± ), (2.7.14) 

where 

{r, G„} = [r, L„] = 0. (2.7.15) 

For the lowest state, we find that the determinants are different for opposite 
chirality (due to the central term in the 0-0 anticommutator in the algebra, that 
is, Gq = Lq — c/16): 

det(A/ 0 + ) = 1, det(M 0 “) = (h - c/16). (2.7.16) 

For higher levels, they are the same 

det(M„ + ) = det(M~) =(*--) [ h ~ 

(2.7.17) 

where the product over p, q is over all positive integers p, q subject to the 
constraint that pq/2 < n and p — q is odd. In turn, pR(k) is given by taking 
the coefficients of the power expansion 

00 00 n t k \ 

f^t k p R (k) = J2 { ——. (2.7.18) 

*=o *=i v ' 

The superconformal minimal Ramond series is then the same as the one found 
for the Neveu-Schwarz space. 



2.7 Superconformal Minimal Series 63 


One interesting fact is that the first member of the superconformal minimal 
series is given by 


m = 3, c=Z, c=^ (2.7.19) 

which is precisely the same value found for the second member of the confor¬ 
mal minimal series in Eq. (2.5.14). In fact, this is the only value for which the 
conformal and superconformal minimal series coincide. 

The tricritical Ising model, therefore, actually has a superconformal rep¬ 
resentation as well as a conformal one. (This means that, experimentally 
speaking, it is possible to find a superconformal representation in nature. The 
adsorbing of helium-4 on krypton-plated graphite provides the first known 
example of a realization of a superconformal theory.) 

The tricritical model has a Z 2 symmetry, which flips the order operators, the 
Ising spins, and the disorder operators. The even sector of the tricritical model 
then corresponds to the NS sector of the N = 1 superconformal theory. The 
odd sector, then, corresponds to the Ramond sector. 

The allowed values of h for the two sectors are given by 


NS: 

h 1,1 

= o, 

^2,2 = 

1 

10 

R: 

h\,2 

"is 

II 

hi,\ — 

7 

16 


(2.7.20) 


Since the states of the tricritical Ising model can be represented either in 
terms of the standard Virasoro theory or with the superconformal theory (with 
different values of c), we can then see the correspondence between states 
defined in one representation expressed as sums of states in the other. For ex¬ 
ample, the following NS states can be expressed as sums over bosonic Virasoro 
states 


~ ®)ns — — 0 } V ir © — 2 )vir’ / 2 7 2n 

I* = ii>NS = \ h = Si)v„ ® I* = 6)™- 

We can use the same techniques used to calculate the correlation functions of 
the minimal series to calculate the correlation functions for the superconformal 
one. Specifically, we note that a null state is given by 

[G- V 2 ~ (f)Z.-iG_ 1/2 ]|ft = i). (2.7.22) 

As expected, coupling this null state to a product of superfields gives zero, 
which in turn gives us differential equations for the correlation functions. For 
example, the correlation function of four superfields can be calculated. Let <t> 
be a superfield 


0(z, z, <9, 0) = € + 9xlr + Of + 96t (2.7.23) 

with conformal weight given by (A, h) = (^, ^), while the conformal weights 
of the various fields are given by (^, ^) for the € field, , yj) for the t field, 



64 2. BPZ Bootstrap and Minimal Models 

and (—, j^j) for the ^ field. Then, its correlation function is given by [21]: 

(4>(zi, 0l)<t>(z 2 , 6> 2 )3>(Z3, 0 3 )^(z 4 , 0 4 )) = |zi2223Z34£4ir 15 (|/| 2 + A\g\ 2 ), 

(2.7.24) 

where 

/= 1 + ( ? %) {[^(i --|, bn)} 

+ ^[7d-7)] 9/1 V 1 (f^,f,,) 

8 = 1 + (^%) {[^ 1 - 7 ?)]' 2 ' F i(l’5’f’'7)} (2.7.25) 

+ ^[7(l-7)r 1/ V 1 (-f,-|,-f^) 

, (pr(f)r(j) 3 
r(i)r(f) 3 ’ 

where 

*7 = Z\2Z34/Z\3Z24> ? = 1 — *7 ~ ^14^23Al3^24- (2.7.26) 

Lastly, we note some more equivalences between the superconformal min¬ 
imal series and known statistical models. We noted before that the m = 3 
superconformal theory is identical to the minimal bosonic m — 4 theory. 
We also note that the m = 4 theory is equivalent to a special case of the 
Ashkin-Teller model, which we shall study in Chapter 6. Also, the m — 6 
superconformal theory is equivalent to a critical point of the Z 6 Ising model. 

In summary, the power of our formalism is that we can solve for all Green’s 
functions of the minimal models using the differential equations that they 
satisfy. Although the minimal models are unrealistic, with only a finite number 
of primary fields, they give us an invaluable laboratory in which to test many of 
our ideas concerning the full string theory, such as supersymmetry and modular 
invariance. 


2.8 Summary 

All perturbative vacuums of string theory possess conformal symmetry. Thus, 
it is important to search for a classification scheme for conformal field theories 
to determine the physics behind string theory. A two-dimensional conformal 
field theory, however, is special in that it has an infinite number of conserved 
currents. Thus, it is often possible to solve them exactly. 

We say that a conformal field has weight h\ +h 2 and conformal spin h\—h 2 
if it transforms as 

« z - i>= (£) (§) (zsA> 




2.8 Summary 65 


under a conformal transformation. (We will often drop the z dependence in 
this book because we have two exact copies of the same algebra. It can always 
be restored later.) 

If we power expand this infinitesimally, we find 

S0(z) = €(z) 3 z4>(z) + h 3 Z €(z)0(z). (2.8.2) 


A field that transforms in such a manner is called a primary field. 
One example of a weight zero field is a free boson, with 


1 - 

L — — d(p 30, 

Z7T 

T(z) = -\d z <t>(z)d z <Kz). 
It obeys the operator product expansion 


(2.8.3) 


T(z)T(w) ~ I 


2 (z — u>) 4 (z - w) 


;T(W)- 


1 


(z - w) 


d w T(w) + ---, (2.8.4) 


where c is the central charge of the Virasoro algebra; c — 1 for the free boson 
and \ for a free fermion. 

Representations of the conformal group are constructed out of a highest 
weight state, labeled by two numbers h,c: 


Lo\h, c) = h\h, c), 

L n \h,c) = 0, n — 1,2,.... 


Then, the universal enveloping algebra is created by all products of the 
ladder operators acting on the highest weight state 

|{n}) = (2.8.6) 


The elements of this set are called Verma modules. 

One of the most important tools in conformal field theory is the Kac 
determinant equation. Wien it is not zero, the Verma module is irreducible. 
The determinant is 


det M n = Y\ ^k(h, c) P(n k) , (2.8.7) 

k =1 


where 

tkih, c) = Y[ \- h ~ h P,q( C )] 

pq=k 


( 2 . 8 . 8 ) 



66 2. BPZ Bootstrap and Minimal Models 


and 


hp,q(c) — 


[{m + 1 )p — mq] 2 — 1 


4m (m + 1) 
where the parameter c is related to m via 

6 


c = 1 


m(m + 1)’ 


1 1 25-c 

m = — ± -J -, 

2 2V 1 -c 


(2.8.9) 


( 2 . 8 . 10 ) 


where p and q are positive integers. 

Usually, the representations in which we are interested have an infinite 
number of primary fields. However, when the Kac determinant is zero, the 
representation is reducible, and one can truncate the Verma modules until one 
obtains a finite number of primary fields. 

Unitary representations with a finite number of primary fields occur for 
c < 1. For minimal models, the operator product expansions, correlation 
functions, partition functions, etc., can be solved exactly. 

The operator product expansion of two arbitrary primary fields with weights 
n and m yields the following series: 


4>n(z, z)4>m( 0) ~ EE ^nm finm firm, 
P 


;{*} 

nm 


x i 


.&p-&n-8 n +Zikj& P -8 n -8m+Ei k^ikUk) 


0™(O), 


[<Pn] X [(t> m ] = 


( 2 . 8 . 11 ) 


This is called the fusion rule. By acting on both sides of the equation with a 
conformal transformation, one obtains an infinite series of equations. Solving 
them gives an explicit solution for the coefficients appearing in the equation. 

Correlation functions can be solved exactly. Correlation functions obey the 
following Ward-like identity 


{T(z)<f>l(Wi)<j> 2 (w 2 ) • • • 4> n (Wn)) 


-E 


Si 

(z - Wi) 2 


+ 


1 

z - wi 


3 

3 w t 



(Wl) ■ ■ ■ <j>n(w„)), 


( 2 . 8 . 12 ) 


which is derived by inserting T into the correlation function and then com¬ 
muting T past the 0’s. This leads to a differential equation that can be solved 
for minimal models. 

For example, the correlation function 

G* m (x) = Y, c L c up F !l{p\x)> 

p 


(2.8.13) 



References 67 


where 


f!L(p\ x ) = x Sp Sn Sm 

{k} 

(*#/(f l)L- ki L- k2 ...L- kfl \p) 
(k\Mh 1)1 p) 


(2.8.14) 


contains the function F l 2 3 4 5 6 7 8 9 10 11 12 f m {p |jt), which is called a conformal block, because 
higher-order correlation functions can always be written in terms of such 
blocks. They are the building blocks by which we can write arbitrary correlation 
functions. They can be solved explicitly in the minimal models. 

Last, we note that superconformal theories also have unitary minimal 
models. Their central charge is given by 


c(m) — 1 — 


8 

m(m + 2) 


(2.8.15) 


Thus, minimal models, because they only have a finite number of primary 
fields, have correlation functions that can be solved exactly, and therefore, they 
give us valuable insights into the structure of string theory, which necessarily 
has an infinite number of primary fields. 


References 


1. A. A. Belavin, A. M. Polyakov, and A. B. Zamolodchikov, Nucl. Phys. B241,333 
(1984). 

2. D. Friedan, E. Martinec, and S. Shenker, Nucl. Phys. B271, 93 (1986). 

For reviews, see Refs. 3 to 10. 

3. L. Alvarez-Gaume, C. Gomez, and G. Sierra, “Topics in Conformal Field Theory,” 
in Physics and Mathematics of Strings, L. Brink, D. Friedan, and A. M. Polyakov, 
eds., World Scientific, Singapore (1990); G. Moore and N. Seiberg, Lectures on 
RCFT, 1989 Trieste Summer School. 

4. M. Peskin, 1986 Santa Cruz TASI lectures, SLAC-PUB-4251. 

5. T. Banks, 1987 Santa Cruz TASI lectures, SCIPP 87/111. 

6. D. Friedan, in Unified String Theories , M. Green and D. Gross, eds., World 
Scientific, Singapore (1986). 

7. T. Eguchi, Inst. Phys. Lectures, Taipei (1986). 

8. J. Cardy, in Phase Transitions 11, Academic Press, San Diego (1987). 

9. P. Ginsparg and J. L. Cardy, in Fields, Strings, and Critical Phenomena, 1988 Les 
Houches School, E. Brezin and J. Zinn-Justin, eds., Elsevier Science, Amsterdam 
0989). 

10. VI. S. Dotsenko, Lectures on Conformal Field Theory , Advances in Studies in 
Pure Mathematics, vol. 16 (1988). 

11. M. A. Virasoro, Phys. Rev. Dl, 2933 (1970). 

12. V. Kac, Infinite-dimensional Lie Algebras , Birkhaeuser, Basel (1983). 



68 2. BPZ Bootstrap and Minimal Models 


13. V. Kac, in Lecture Notes in Physics vol. 94, Springer-Verlag, Berlin (1979). 

14. B. L. Feigin and D. B Fuchs, Functional Anal Appl 16 (1982). 

15. C. B. Thom, Nucl. Phys. B248, 551 (1984). 

16. D. Friedan, Z. Qiu, and S. H. Shenker, in Vertex Operators in Mathematics and 
Physics , Springer-Verlag, Berlin (1985). 

17. D. A. Huse, Phys. Rev. B30, 3908 (1984). 

18. G. E. Andrews, R. J. Baxter, and R J. Forrester, J. Statist. Phys. 35, 193 (1984). 

19. A. Meurman and A. Rocha-Caridi, MSRI preprint. 

20. C. Thom, Nucl. Phys. B248, 551 (1984). 

21. D. Friedan, Z. Qiu, and S. Shenker, Phys. Lett. 151B, 37 (1984). 



CHAPTER 3 


WZW Model, 

Cosets, and Rational 
Conformal Field Theory 


3.1 Compactification and the WZW Model 

In the previous chapter, we emphasized the importance of conformal invariance 
as a stringent requirement that allowed us to calculate many of the simpler 
Green’s functions from first principles. Because the conformal group has an 
infinite number of generators, a surprisingly large number of mathematical 
results flow from the requirement of conformal invariance alone. 

Unfortunately, conformal invariance alone cannot determine the correlation 
functions that we desire, nor can it lead to a realistic string theory. In real¬ 
ity, conformal invariance alone cannot explain the rich diversity of particles 
found in nature, which includes particles transforming under a gauge group 
and perhaps supersymmetry. 

In the next two chapters, therefore, we explore additional constraints that 
will define the model and give us a more realistic phenomenology. In this 
chapter, we introduce the concept of compactification, that is, the curling up 
of some of the unwanted dimensions into a compact manifold, leaving us with 
a physical, four-dimensional theory. 

Because there may be symmetries associated with the compactified space, 
we will introduce new symmetries into the theory, which are described by 
Kac-Moody algebras [1]. In this regard, string theory has revived an old trick 
due to Kaluza, introduced in 1919 [2]. 

Kaluza’s idea was to embed both Maxwell’s equations and Einstein’s theory 
of gravity into a single field, the metric tensor g AB in five dimensions. Let us 
decompose the five-dimensional metric tensor as follows: 

{g»v _ (g»v + K 2 A^A v /cA m \ 

\£5v #55/ \ (p ) 


(3.1.1) 




70 3. WZW Model, Cosets, and Rational Conformal Field Theory 

Let us assume that the unseen fifth dimension has compactified into a circle, 
that is, 


x$ — X 5 + /?, (3.1.2) 

so the fifth dimension is periodic. If we take R small enough, derivatives with 
respect to the fifth dimension will be small and can be neglected. 

With this reduction, the variation of the g^ 5 field yields 

SA^-d^s (3.1.3) 

and the final action is the sum of the Einstein action and the Maxwell action 

L = ~R - \ y /gg‘" , g af, F lia F vP + ■■■. (3.1.4) 

To explain why the fifth dimension was never seen, Kaluza speculated that 
the fifth dimension had curled up into a small circle, too small to be experimen¬ 
tally observed. Thus, although Kaluza’s idea gave great elegance and beauty 
to a unification of gravity with light, he had no idea why the fifth dimension 
had curled up or what size it was. 

This process can be duplicated for higher dimensions. If we take Einstein’s 
theory in higher dimensions beyond the fifth, we can compactify the unwanted 
dimensions on a manifold that, like Kaluza’s circle, has certain symmetries 
or isometries associated with it. This symmetry can be represented by a Lie 
group. Not surprisingly, Einstein’s theory in N dimensions then reduces to 
Yang-Mills theory coupled to four-dimensional gravity. 

String theory must necessarily incorporate Kaluza’s compactification 
scheme if it is to become a realistic theory. This is both its strength and weak¬ 
ness. With relatively few assumptions, one can show that a compactification 
of string theory yields remarkably realistic phenomenological models. This is 
the advantage of compactification. 

The weakness of this compactification, of course, is that we still cannot 
answer the questions raised by Kaluza 70 years ago, for example, why did the 
universe compactify in this manner? 

Now we wish to generalize this discussion to a bosonic string propagating 
in curved or compactified space-time. Specifically, we wish the first quantized 
action to contain the term 

L = -G llv (X)d a X> 1 d b X v g ab + ---, (3.1.5) 

JT 

where we have now explicitly added the curvature of space-time through the 
metric tensor G MV . In general, this action is much too difficult to solve ex¬ 
actly, so we will make simplifications. Depending on which assumption we 
make about the background metric, we will arrive at different conformal field 
theories. 

Assume, for the moment, that the string is propagating on a manifold speci¬ 
fied by a Lie group, that is, a group manifold. Let G be a semisimple Lie group 
and let g be an element of this group. We will exploit the similarity between 



3.1 Compactification and the WZW Model 71 


string theory and the sigma model, so our first guess for a string propagating 
on this group manifold may be 

L — — tr(9 a g _1 d a g), (3.1.6) 

it 

where g is a function of the string field X^. In this form, we can calculate G )IV 
in terms of the g field. By differentiating, we find that 9 a g = d a X ll f aiJ -(X) for 
some function / aM . Then, the metric G M „ can be expressed in terms of f m . 

It turns out, however, that the naive choice of our action is incorrect. Treating 
the model as a a model, it can be shown that it is not conformally invariant. 
In order to have a fully conformally invariant model, let us modify our naive 
choice and add a new term to the previous action 

s = ^f d °s) d2 $ +* r o?)’ < 3 - L7 ) 

where the new term is called a Wess-Zumino term 

r (g) = ^ J d*X e aPY tr[(g _1 9 a g)(g _1 d p g)(g~ l 9 y g)]. (3.1.8) 

The Wess-Zumino term is integrated over a three-dimensional disk whose 
boundary is two-dimensional space-time. 

For k = 0, this theory reduces to the familiar a model, which is known 
to be asymptotically free and massive. Thus, conformal symmetry is violated, 
and the model is not suitable for our purposes. However, for k = 1,2,..., 
the theory becomes effectively massless and possesses an infrared-stable fixed 
point at 


X 2 = 4j x/k. (3.1.9) 

Therefore, at these special values of k , we have a conformally invariant a 
model where the theory is defined on a group manifold. We will call the action 
at this value, the Wess-Zumino-Witten WZW model [3, 4]. In addition to 
conformal invariance, the remarkable feature of this model is that it is also 
invariant under the following transformation: 

g(f) ^ 

z = §'+;§ 2 , (3.1.10) 

z = $‘ -4 2 . 

One can show that the action is invariant under this symmetry using the identity: 
S(gh-') = S(g) + S(h)+^ j (g-'d- z gh-'d z h)d 2 i;. 


(3.1.11) 



72 3. WZW Model, Cosets, and Rational Conformal Field Theory 


To analyze this new symmetry further, let us now extract the generators of 
this symmetry, which will turn out to represent an infinite set of currents [ 3 , 5 ]: 


J = -i kd z gg 1 = J a t a , 
J = - \kg _I 8- z g =J°t a , 


(3.1.12) 


where 


dzJ = 0, d,J = 0, (3.1.13) 

where t a are the generators of a Lie algebra. 

Let us decompose the generator J in terms of its moments 

00 

J(Z)= E (3.1.14) 

n=—o o 

The J ’s generate an algebra given by 

[C jb J = f abcj n+m + - 2 knS ab S n+m , 0 . (3.1.15) 

This is a special case of what is called a Kac-Moody algebra [ 1 , 6 , 7 ], and 
it effectively smears the generators of an ordinary Lie algebra around a circle 
or string. Notice that for n = m = 0, we retrieve a classical Lie algebra. 

Since the conformal anomaly vanishes at the fixed point of the WZW theory, 
the theory is conformally invariant, and it should be possible to give the explicit 
form of the energy-momentum tensor T (z) in terms of these currents. In fact, 
we find the Sugawara form of the energy-momentum tensor 

T(z) = h : : = h £ : J °— J ~ : <3 L16) 

n,m 

where 

K = -i(c„ + k), fabcfbcd = CttS ad' (3.1.17) 

where c v is called the second Casimir of the adjoint representation of the Lie 
algebra. 

Written in component form, we have the Virasoro generators written as 

j oo 

Ln - E : Vn-m ' • (3-1-18) 

Cv K m=—oo 

If we commute two generators of the Virasoro algebra written in Sugawara 
form, we find [see (1.1.24)]: 

c = kD/(c v + k), (3.1.19) 

where D is the dimension of the group. 

Last, we find that the two algebras can be spliced together by taking the 
semidirect sum of their generators 

[L„, J a J = 


(3.1.20) 



3.1 Compactification and the WZW Model 73 


Written in terms of conformal operators, we have 


T(z)J a (z') 

J a (z)J\z') 


1 

(z — z') 2 


J a (z) + 



dJ a (z'), 


\kS ab 
(z - z') 2 



dJ c (z'). 


(3.1.21) 


Example: O(D) and SU(N) 

Now consider, for the moment, the following tensor field composed out of 
fermion fields 


JUz) = (3.1.22) 

where we use 

Mz)Mu 0 ~ W(* - W). (3.1.23) 

Notice that J^ satisfies the commutation relations of the Kac-Moody alge¬ 
bra 0(D). Thus, fermion fields give us a simple realization of both a conformal 
field theory as well as a Kac-Moody algebra. 

We could also have taken 


J a (z) = Hz)t a ir(z), (3.1.24) 


where ifr(z) transforms in the vector representation of 50(D). Then, we find 
that the Kac-Moody algebra of 50(D) with k = 1 is satisfied, with 


\D(D — 1) D 
Cs<w -' - l+(D-2) = 2 


(3.1.25) 


The central term is thus consistent with D free fermion fields. 

Let us say that we now have complex fermions, transforming under SU(N), 
such that 


J a (z) = V(z)t a f(z). (3.1.26) 

Then, it is easy to check that the affine SU(N) ® U(l) is realized, so that the 
central term is 

(N 2 - 1) 

Q/(i) + c S u(N) = 1 H — i ± jy = N (3.1.27) 

which is consistent with N free complex fermion fields. 

Example: Bosonization 

Let us now introduce two new techniques, bosonization and external charges, 
which will prove invaluable in constructing explicit representations of Kac- 
Moody algebras and conformal field theories. We will use these two techniques 
repeatedly throughout this book. 



74 3. WZW Model, Cosets, and Rational Conformal Field Theory 


In two dimensions, because Lorentz transformations have only one gener¬ 
ator, the distinction between a boson and a fermion is not that great. In fact, 
the main distinction lies in their statistics. Thus, it is possible to exponentiate 
a boson and (after normal ordering) obtain a fermion field, similar to the way 
that we exponentiate the string variable in order to obtain a vertex function for 
the Veneziano model. 

Let us now calculate the conformal weight of this vertex operator. A 
straightforward calculation yields 


T(w): e q<Kz) : 


-i[<3 w cP(w)q4>(z))] 2 : e q ^ : 

- \2d w (f)(w)(d w <i)(w)q(p(z)) : e q<i>{z) : 

~ 4 2 / 2 . e q<Kz) . | 3 d w<l> . e qd>(z) 

(w — z) 2 w — z 


(3.1.28) 


so that the vertex operator has conformal weight equal to —q 2 / 2. (We will 
often use the fact that a vertex e iq(f> has conformal weight q 2 j 2.) 

We also find that 


^ e ioci<p(zi) e ia 2 (z 2 ) 


ia N <f>(zN ) 


) = 1 > _ z ;)“ 


i<J 


(3.1.29) 


where the a t sum to zero. (It is easy to check that this formula has the correct 
conformal weight. The left-hand side has conformal weight JL a 2 / 2, while 
the right-hand side has weight — J2 i<j a i a j- These two expressions, however, 
are equal, which can easily be seen by squaring the sum of the a,-, which is 
zero.) 

One of the most important uses of this formalism is to create fermion opera¬ 
tors out of boson operators (which is only possible in two dimensions). Notice 
that the vertex operator has anticommutation relations with itself, so that we 
can consider it to be a fermion. For example, for a = 1, we have a fermion 
with a conformal weight equal to as expected. 


Example: External Charges 

Let us examine the case of the free boson with the energy-momentum tensor 
given by 

T(z) = € -[3 z <P(z)f - § d 2 z 4>(z). (3.1.30) 

Notice that we have made several changes in the usual form for the energy- 
momentum tensor. First, we have put in a factor of e = ±1 in T. In order to 
satisfy the usual operator product expansion of T in the presence of e, we must 
alter Eq. (2.2.8) to read 

<j)(w)4>(z) ~ e In (w — z). 

Note the presence of e in front of the logarithm. 


(3.1.31) 



3.2 Frenkel-Kac Construction 75 


Second, we have added a term proportional to d 2 <j). Because of the presence 
of this last term, we have altered the conformal properties of the free boson. 
Let us now calculate the contribution of this last term to the central term. We 
find 

T(w)T{z) ~ 2 (0 2 (d w 4> d z (p} 2 + (d 2 w (t> d 2 4>) + ••• 

~ 2 (w - zf ~ (t) Q (w - zf + ''' ’ ( 3 - L32 > 

so that the central term is equal to 

c= l-3tQ 2 . (3.1.33) 

For Q = 0, we arrive back at the free boson. We will, however, use the 
case <2^0 extensively. For example, the conformal weight of e q(f> can also be 
calculated, and it is now given by 

\eq(q + Q ). (3.1.34) 

We will use this expression for the conformal weight of the vertex operator 
throughout this book. 


3.2 Frenkel-Kac Construction 

Let us now use the methods developed in the last section, such as bosoniza- 
tion, to write explicit representations of Kac-Moody algebras. This is the 
Frenkel-Kac construction [8-10], which in turn is based on the Cartan-Weyl 
representation of an ordinary Lie algebra. The Frenkel-Kac construction of 
the Kac-Moody algebra is perhaps the most commonly used representation. 

A Lie algebra is usually written as 

[*a, T b ] = f C ab T c . (3.2.1) 

Although concise and elegant, this representation tells us very little about 
the structure of the Lie algebra, which is hidden within the structure constants. 
Thus, we will sometimes find more convenient the Cartan-Weyl representation, 
which displays the structure of the algebra in a more transparent way. The 
Cartan-Weyl construction is based on the fact that, within an ordinary Lie 
algebra, we have two types of generators: the generators H i9 which mutually 
commute among themselves (forming the Cartan subalgebra), and the set E a 
of all other generators. 

In general, the number of generators within the Cartan subalgebra is called 
the rankr of the algebra. Thus, SU( 2) has rank 1 because L z is usually singled 
out as the generator of the Cartan subalgebra. For SU (3), the rank is 2, because 
T 3 and Y are usually singled out as the mutually commuting operators. 



76 3. WZW Model, Cosets, and Rational Conformal Field Theory 


The other elements of the algebra E a are labeled by the vectors a , called the 
root vectors , which live in an r-dimensional space. The number of elements 
within the Cartan subalgebra and the number of vectors a obviously equals 
the number of parameters of the group, called the dimension. 

The complete commutation relations of the Lie algebra can now be rewritten 
in terms of the Cartan subalgebra and the root vectors as 


[Hi, Hj] = 0 , 

[H h E a ] = ctiHi, 

[E a , Ep ] = N a ,pE a +p, 
[ E a , £L a ] — ctiHi, 


( 3 . 2 . 2 ) 


where the N Ut p are the structure constants of the algebra. 

We would now like to find a representation of the Cartan-Weyl basis, gener¬ 
alized to the case of the Kac-Moody algebra, based on free boson fields. Let us 
begin by writing the generators of 0(2N). We begin with N free boson fields 
(j>i with weight zero. The simplest representation of the Cartan subalgebra, with 
conformal weight 1, is, therefore 


Htiz) = 3 0 f (z). (3.2.3) 

Because the 0/ are free, the generators obviously commute with each other. 

We now need a representation of the other elements E a . Our first guess 
might be something like : exp(0; + <!>j) which has conformal spin 1. This 
naive choice almost works, but it has the wrong commutation relations. 

To remedy this, let us define e- x to be a unit vector in the zth direction. (We 
suppress the index labeling the isospin space.) Then, 0 is also a vector in this 
space. Now, define 

f±e t (z) = exp (±e x • <p)c ±€i , 

E±i±j = f±eJ±ej = -E ±j±i , j # 1 . 

The numbers c± ei are constants, called the cocycles , which must be added 
to the definition in order to get the correct statistics for the generators of the 
algebra. A convenient choice of these cocycles is 

d =(-l)^+^ + -+^- 1 , (3.2.5) 


where N x is the fermion number for the zth fermion. (Actually, there exists a 
wide variety of choices for the cocycle.) 

With this definition, it is now straightforward to check [8, 11]: 


Hj(z)H k (w) ~ Sjk 

(z - wy 


Hj(z)E a 


a j E a 
z — w’ 


1 

- 


| [Hj(w)±H k (w)] 


Ej± k (z)E- m {w) 


(z — w ) 2 


z — w 


( 3 . 2 . 6 ) 



3.2 Frenkel-Kac Construction 77 


E±i±j(z)E Tj±k (w) ~ E±,±k( - W \ i ^ k, 
z — w 

E±i±j(z)E± k± i(w ) ~ finite. 

Thus, we have now successfully represented the generators of the Lie algebra 
for affine 0(2N) in terms of N free boson fields, </>*. 

As we mentioned earlier, we can also represent the 0(2N) Kac-Moody 
algebra in terms of fermion fields i/'a- 

Let 

f±ej = -^= (fij -1 T ifij)- (3.2.7) 

Then, the generators of 0(2N) can be written in terms of these fermion 
fields as 


Jmn = 


(3.2.8) 


Then, the following algebra is satisfied 

_ ... , . k(8Mp8NQ — 8 M q8np) 

Jmn(z)J pq (w ) ~- { - ~y - 


A&mpJnq ~ SmqJnp ~ <$npJmq + &nqJmp)( w ) 


We also have the relation 

WW.) ~ ~i *"'*»-*"'*“ + finite. 

z — w 


(3.2.9) 


(3.2.10) 


In calculating this operator product expansion, we have used the convenient 
formula (which is easily derived from the Baker-Hausdorff formula e A e B = 

^+B+(l/2)[X,B]+ --j J-j 1] ; 

O k (z)O k (w) ~ (z — w) kk ' exp[ijr(X • MX')] 

x exp[7. • 4>(z) + X' ■ (j)(w)]c A+ )j 
~ (z - w) kk exp[/7r(A. • MX')]0 A+> ' 
x {1 + (z — w)X -3 <p + \{z — w) 2 
x [X-d 2 <f> + (X-d<p) 2 ] + ---}(w), (3.2.11) 


where 


0'(z) = e } -*c k . 


(3.2.12) 


The advantage of this bosonized formalism is that we can now write an 
explicit representation of the space-time spinors S a in Eq. (2.7.13) occurring 
in the Ramond sector of the superstring in terms of free bosons. In this way, 
we can now represent both the NS and R fields in terms of a more basic set of 
conformal fields. 



78 3. WZW Model, Cosets, and Rational Conformal Field Theory 

In 2N dimensions, we want to construct a 2 N component spinor. To do this, 
let us first write the following row matrix (with N entries): 

A = (±, ±,..., ±, ±)/2, (3.2.13) 

which can take on 2 N possible values. Our first guess for a spinor might be 

e^ (3.2.14) 

but this has the wrong conformal weight. 

We recall from Eqs. (3.1.30)—(3.1.33) that: e“ cf> : has conformal weight \a 2 
for € = +1 and Q = 0, so that a spinor composed in this fashion would have 
weight .'V/8. In 10 dimensions, this spinor would have weight |, so we need a 
new field with weight | to give us the desired weight of 1. 

We thus introduce a new field, <p 6 (with e = — 1 and Q = 2) so that 

T(z ) : e x ' 0+ ^ 6 : ~ [±A. • A - 1 - \q{q + 2)](z - w ) 2 : «*•*+** :, (3.2.15) 

that is, the 4>(, field contributes — (^)q(q + 2) to the conformal weight (i.e., it 
is defined with a background charge). If we choose q — — j, then the spinor 
has unit conformal weight 


1 + 2 = i 
8^8 1 


(3.2.16) 


as desired. 

The operator product expansions of the spin field S A with the Lorentz 
generators and the anticommuting vector xj/ M are 


JmnS (u >)~—-i 


i.(iW s sV) 


z — w 

A oB, 


t M S\w) ~ — 


1 (r M )%5 e (i(;) 


a/2 ■s/z — w 


(3.2.17) 


The constant factors appearing in the operator product expansion, called T 
matrices, can be shown to satisfy the properties of the usual Dirac matrices 


{r w , r*} = 2 Bmn, 


(3.2.18) 


so we will simply define them to be the Dirac matrices for 0(2N). 

The operator product expansion therefore gives us an explicit representation 
of these Dirac matrices. They can be given as follows. Let us define the usual 
Pauli spin matrices 



(3.2.19) 



3.3 GKO Coset Construction 79 


and cr 0 is the unit matrix. Then, the Dirac matrices can be given as 

r 2J ~ l = (—1 

T 2 '- 1 = -(-iy /2 (a 3 0y- 1 a 2 (0a°) iv ^, 
r 2 '' = (-l)°- 1)/2 (a 3 0y- 1 a 2 (0CT O ) /v -y 
r 2 ' = (-iy / V 3 0y- 1 a 1 (0cr o ) A, -y 

Notice that we can define the vector {A, q) to span a six-dimensional space. 
Thus, in this completely bosonized formalism, each element of the algebra 
can be uniquely specified by fixing the value of {L, q). This gives us a lattice 
representation of all fields occurring in the affine 0(2N) construction. 

In the lattice construction, all conformal operators appearing in the theory 
can be expressed as a point in this lattice space via bosonization. 


j odd, 
j even, 
j odd, 
j even. 


(3.2.20) 


3.3 GKO Coset Construction 


In our search for representations of the conformal group, we were aided by the 
fact that the Kac—Moody algebra gave us, via the Sugawara construction [Eq. 
(3.1.16)], an explicit representation of the Virasoro algebra for certain values 
of c. The value of c obtained by the Sugawara construction is always greater 
than or equal to 1. 

For SU(2) k , for example, from Eq. (3.1.19), we find 

3k 

Csu(2) = FT r (3 - 3 - 1} 

For an arbitrary Lie group G, we find 


rank G < c G < dim G. 


(3.3.2) 


However, we can use a trick, called the Goddard, Kent, and Olive (GKO) “coset 
construction,” which allows us an explicit representation of all minimal models 
(as well as possibly all rational conformal field theories, which have rational 
values of h, c) [12]. 

Let us say that the group G contains a subgroup H. Now, let us construct 
the generators of conformal transformations in terms of the current J‘ ( ‘ ; trans¬ 
forming under G as well as the current Jf/ transforming under H. Our goal is 
to construct the conformal generator associated with G/H. 

Using the Sugawara construction for T a in terms of J G in Eq. (3.1.16), we 
can calculate [see Eq. (3.1.21)]: 


T g (.z)Jh(w) 


But, we also know 


T H (z)J a H (w ) 


J a H (w) | dJ°(w) 
(z — id) 2 z — vo 

Jh(u>) , dJ a H (w) 
(z — w) 2 z — w 


(3.3.3) 


(3.3.4) 



80 3. WZW Model, Cosets, and Rational Conformal Field Theory 

Notice that the right-hand side of both equations is the same. Thus, if we 
subtract the two equations, the right-hand side will equal zero. 

If we write 


T g = (T g - T„) + T H = T g/h + T„, (3.3.5) 


then we also have 


[T G /h, Th ] = 0 . 


(3.3.6) 


This last equation means that T G can be split into two mutually commuting 
pieces, T G /h and T H , both of which generate representations of the conformal 
algebra (but with different values of c). If we now calculate the operator product 
expansion for T G , we find 

TcWTcl + ( 3 . 3 , 7 ) 

2 (z - w) 4 

In other words, we now have the final expression [see Eq. (3.1.19)]: 


cg/h — cg ~ Cf{ 


k G \G\ k H \H\ 
kG + h G kn + hfj 


(3.3.8) 


where h is the second Casimir of the adjoint representation of the group, and 
the vertical bars represent the dimension of the group. 

This is the desired result. Because T G has been decomposed into two mu¬ 
tually commuting pieces, the central term of the coset algebra generated by 
T gjh is given by the difference of the central terms for T G and T H . Obviously, 
c G/H can be less than one. 

A simple example is given by the following: 


G/H = SU(2) k ® SU(2)\/SU(2) k +\. 
The value of the central term is therefore given by 

6 


C G/H 


3k n 3 (* + 1) _ t 


k + 2 


(* + !) +2 


(jfc + 2)(* + 3)’ 


(3.3.9) 


(3.3.10) 


which is precisely the discrete sequence of the minimal unitary models for 
m = k + 2 = 3,4, 5,... as in Eq. (2.5.8). Thus, we have the correspondence 


unitary series SU(2) k ® SU{2)\/SU(2 ) k +1 ■ 
Yet another sequence is given by 

G/H - SU(2) k ® SU(2) 2 /SU(2) k+2 


(3.3.11) 


(3.3.12) 


for which we have 

3k 3 3(* + 2) 

Cc/H ~ k + 2 + 2 (k + 2)+ 2 



2 


(3.3.13) 


(k + 2)(k + 4)J 



3.4 Conformal and Current Blocks 81 


We immediately recognize this as generating the superconformal N = \ 
discrete series form = k+2 inEq. (2.7.10). Thus, we have the correspondence 

N = 1 unitary series 4* SU(2) k ® SU(2) 2 /SU(2) M - (3.3.14) 

The GKO coset construction, of course, gives us the power to generate much 
larger representations of the conformal group. It is believed that all rational 
conformal field theories, not just the minimal ones, can be constructed in this 
fashion. In fact, the GKO construction gives us one of the most powerful 
methods of unifying conformal field theories. 

Although the coset construction has great power in unifying conformal field 
theories, it has a fundamental weakness. To understand how to construct the 
specific tensor representations involved in G/H , we must know the corre¬ 
sponding representations in G and H , which are in general not known. Thus, 
although the coset construction is one of the most general procedures yet found 
to unify conformal field theories, in actual practice, it is sometimes not very 
useful for specific calculations. 


3.4 Conformal and Current Blocks 

As in the case of the minimal models, where we could solve for the correla¬ 
tion functions by solving certain differential equations involving the conformal 
blocks, we find a similar situation with regard to the WZW model. Again, we 
will find that the lower-order correlation functions can be determined in terms 
of hypergeometric functions by exploiting their transformation properties 
alone. 

Consider the following operator product expansions between a field 0/ which 
is primary under both the generators of the Kac-Moody and Virasoro algebras 
[see Eq. (2.2.19)]: 

T(w)<(>i(z , z) ~ - ■ 2 <Pi(z, z) + —-— 3 Z <t>i{z, z) H-, 

(w-z) 2 w-z (341) 

J a {w)<}>i(z, z) ~ —— 4>i(z, z)-\ -, 

w — z 

where the first relation states that the field has conformal weight A/, and the 
second states that the current J a acts as a generator of isospin transformations 
on the field, which transforms under some representation of the affine group. 
(We omit the parallel transformation properties involving z.) 

As before, we can use these relations to construct differential equations for 
correlation functions. Let us insert T and J a in an A-point correlation function. 
Then, we arrive at 


N r 


j =i 


+ 


1 


L (z-ZjV Z-ZjdZj 


(Mzu zi) ■ ■ ■), 



82 3. WZW Model, Cosets, and Rational Conformal Field Theory 


N t a 

h) ■ ■ ■) = — [4> i(zi, h) ■ ■ •). (3.4.2) 

j = 1 Z Z J 

Let us now observe that the generators T and J a are regular at infinity, which 
means that as z -> oo these operators behave as 

T(z ) ~ z“ 4 , J a (z) ~ z“ 2 . (3.4.3) 

If we now impose these asymptotic limits on the previous correlation 
function relations, then 

n+1 

2 3 Zj 


+ (n + 1)A jz n j (<t>\(z, Z\) • • • 4>n(zn, Zn)) = 0 (3.4.4) 



for n = —1, 0, -i-1, and 


N 

^2t]{(l>i(z\,Zi)--- 4 >n(z n , Zn)) = o. 


j=i 


(3.4.5) 


Now, we would like to generalize this discussion and calculate Green’s 
functions for the WZW model. We now replace the </> appearing in Green’s 
function with g(z, z), and use the operator product expansion 

c 00 

J a (w)t a g(z , z) = —— g(z, z) + Y ~](w - z) n i t a J° g(z, z), (3.4.6) 

w ~ z 

where t a t a = c g /. The term proportional to w — z raised to the zeroth power 
coincides with the operator 3 Z multiplied by some constant, which we call k. 

Repeating many of the steps we used for the Green’s function of a conformal 
field in Eq. (3.4.2), we find the differential equation satisfied for the Green’s 
function of the WZW model 


tf[J a (z)g{zu Z\ ).. - g(zi, z N )) 


+ L) 7 _ 7 ) {g(zuZi)---g(zuZ N ))- (3.4.7) 


j# 


Now, insert Eq. (3.4.6) into Eq. (3.4.7) and take the limit as z -> z,-. We 
arrive at the desired relationship 


\K 2- - ' 7 ) (g(zu Zi) • • - g(z N , Zn)) = 0. (3.4.8) 

\ ° z < j^i z ‘ z i) 

This is the Kn izhnik-Zamolodchikov (KZ) relation [13], which is useful in 
actually calculating explicit expressions for the correlation functions. (Another 
way of deriving this important relation is to insert a certain null state into the 
matrix element. We notice from Eq. (3.1.18) that L_i — (^/c) : J“Jl x _ m : 
must be zero. When this expression acts on a primary state, the only terms that 



3.4 Conformal and Current Blocks 83 


survive are L-\ — (|xr) : JfJfi : . Inserting null states constructed out of this 
operator into a correlation fiinction, we find immediately the KZ relationship.) 


Example: Four-Point Correlation Function 


As an example to illustrate these techniques, consider, for example, calculating 
the four-point correlation function 

G(Zi, Zi ) = {g(zi, z\)g~'(z 2 , z 2 )g~ l (Z}, h)g(z 4 , u))- (3.4.9) 

By the usual conformal arguments, we know that this expression can be reduced 
to a function of the anharmonic ratios 


Ol - Z 2 )(Z3 - Za) 
(zi - Za)(Z3 - Z 2 ) 


(3.4.10) 


Thus, we find that we can reexpress this correlation function of the g% with 
weight A, in terms of a function of x and x : 


G(zi , zO = [(z\ - z 4 )(z 2 - Zi)(z\ - z 4 )(z 2 - Z 3 )] 2A G(x, x). (3.4.11) 

This is all that conformal invariance can tell us, because we have not yet 
specified the representation to which the fields belong. To be specific, let the 
group G = SU(N) and let g transform in the fundamental representation of 
SU(N) ® SU(N ). Then, we can insert the explicit form of the indices for the 
group elements into g: 

g(zi , Zi) g% (Zi, Zi). (3.4.12) 

Then, the G(x , x) matrix can be written as 

G(x,x)= ( Ia)0b)G ab {x,x ), (3.4.13) 

A,B= 1,2 

where 

h = h = S%8*. (3.4.14) 

Using the KZ relation, let us now contract over the indices and reduce the 
equations to a set of two 2x2 matrix equations 


3 G 
dx 




G , 


3 G 
dx 


= g\^P t + — — q t 

\_X X — 1 


(3.4.15) 


where T represents taking the transpose of the matrices, which are defined 
by [13]: 


1 

2 Nk 


N 2 - 1 
0 



(2 = 


i r~i 

2 Nk L n 


0 

N 2 - 1 


(3.4.16) 


where k = —j(N + k). 

Fortunately, this system of equations is completely solvable. We can find a 
solution in terms of hypergeometric functions. Let us decompose G(x, x) as 



84 3. WZW Model, Cosets, and Rational Conformal Field Theory 

follows: 

G AB (x,x)= Y, U Pq F ( A p \x)F%\x ) (3.4.17) 

p,q=0,\ 

for some constant matrix U. Then, the complete solution is given by [13]: 
Fi°\x) = *~ 2A (1 — x) Al ~ 2A F (-3-, 3_, i + £;*), 

F^°\x) = -(2k + N)- l x l ~ 2A (l - x) A '~ 2A 

xF ('-h’' + l- 2+ f«’ x )- 

F^ix) = x A '~ 2A (l - x) a >- 2a F -^±1, 1 “ ’ 


F 2 (1) (;c) = -Nx a '- 2a 


where 


A = 


(l-x) 


N 2 - 1 


A,-2A i 


N — l N + 


2k 


2k 


1 N \ 

-'-Tk ;X )' 


A, = 


N 


(3.4.18) 


(3.4.19) 


2 N(N + k)’ ‘ N + k' 

In summary, we find that large classes of correlation functions for the WZW 
model could be completely solved using the KZ relation. 


3.5 Racah Coefficients for Rational 
Conformal Field Theory 

In Chapter 2, we saw that, in general, there were an infinite number of primary 
fields for certain values of h and c. However, we also saw that the representa¬ 
tions of the conformal group simplified enormously if we had a finite number 
of primary fields. In this case, the unitary minimal models were unique, and 
we could also calculate exactly their correlation functions. 

When we enlarge our discussion to include fields primary under the Kac- 
Moody algebra, the minimal models are no longer the only models possessing 
a finite number of primaries. In general, for an affine Lie algebra to have a 
finite number of primaries, the values of c and h must be rational numbers. 
Hence, the goal of this chapter is to study what are called rational conformal 
field theories [14]. 

Just a few of the examples of rational conformal field theories include the 
following: 

(1) The WZW models have central charges given by Eq. (3.1.19): 

k dim G 


c — 


c v + k 


(3.5.1) 



3.5 Racah Coefficients for Rational Conformal Field Theory 85 


where c v is the dual Coxeter number, or the Casimir of the adjoint 
representation. 

(2) The unitary discrete series has a central charge given by Eq. (2.5.8): 


c — 1 


m(m + 1)’ 


m = 3,4,.... 


(3.5.2) 


(3) The parafermion models (to be discussed in Chapter 5), have central 
charge: 


2 (N - 1) 

c =-. 

N + 2 

(4) The rational toroidal compactification, has central charge: 

c = d. 


(3.5.3) 


(3.5.4) 


where d is the dimension of the torus. 

(5) The various coset constructions (some of the most phenomenologically 
interesting ones are constructions based on “free fermions” and “free 
bosons”) [15-18] can be represented by lattices when we bosonize all 
the fields. 

(6) The allowed orbifolds of the above exist when there is a discrete symmetry 
[19]. (Orbifolds are not manifolds; they have singularities, which arise 
when we divide a manifold by the action of some discrete symmetry.) 

(7) The tensor products of the above are another example when the central 
charge is the sum of the individual central charges: 

^total — ^ \ cj • (3.5.5) 


The advantage of studying the rational conformal field theories, like the 
simpler minimal models, is that their modular invariant partition functions can 
usually be computed exactly. Thus, they serve as a laboratory to study the more 
complicated compactifications found in string theory. (The proof that rational 
conformal field theories have a finite number of primary fields is given in 
Chapter 4.) 

Because of the close resemblance between a conformal field theory and an 
ordinary Lie algebra, it is possible to take tensor products of representations, 
which in turn yield other representations. In this fashion, we can construct 
the analog of Clebsch-Gordan coefficients and Racah 6 j symbols by taking 
repeated tensor products. 

However, there is a slight, but important, complication. In general, we wish 
to take the tensor product of two Verma modules [</>*] and [</> j ] that have 
the same central charge c. If we carelessly define the product of these two 
representations, we will find that the resulting representation has a central 
charge 2c, which is the sum of the two individual central charges. However, 
we wish to find product representations in which the central charge is still c. 



86 3. WZW Model, Cosets, and Rational Conformal Field Theory 


As in ordinary Lie algebra theory, the goal is to construct a series of self- 
consistency conditions by taking tensor products of representations in different 
orders, decomposing them, and setting them equal at the end. In this way, we 
will find a series of polynomial consistency equations in terms of the conformal 
blocks. 

There are two operations that we wish to perform on these conformal blocks, 
called B (corresponding to braiding or interchanging two points) and F (cor¬ 
responding to pinching the graph, that is, making an 5 -channel graph into a 
r-channel graph). 

On the space of conformal blocks, we may now write a representation of 
the 5-twist operation on a four-point function, representing the interaction of 
conformal fields labeled by /, j, k, l. 

Let us begin with the standard four-point function 

(O|0 , (oo^(z,)0 t (z2)^ , (O)|O) - [i\</> J (zM k {z 2 )\l), (3.5.6) 

where we are taking the correlation function of four primary fields. 

In Eq. (2.4.13), we obtained identities by contracting the fields in different 
orders and then setting them equal. For example, we may first contract the i, j 
fields and the k, l fields. The second way is to contract the i, k and the j, l fields. 
Since the final answer must be the same, we have established a relationship 
between the conformal blocks. 

If we select the pth element in the sum, we find that the two ways of con¬ 
tracting the matrix element via the braiding operator B yields the following 
relationship: 


^ J ip (zi)^ k p i(z2) = B pq 

<1 


j k 
i l 




(3.5.7) 


Viewing Fig. 3.1(a), it is easy to see that the effect of the B operation is 
to interchange z\ and z 2 . Notice that the sum over q corresponds to summing 
over all intermediate states. 

If we write the B matrix as a mapping from the space of one set of three-point 
couplings to another, it can be symbolically represented as 


B 


pq 


j k 
i l 


VL ® V' -* V‘ q <g> vj t . 


JP 


(3.5.8) 


Similarly, we can introduce the operation F in Fig. 3.1(b), which pinches a 
four-point graph [see Eq. (2.4.13)]. On the space of conformal blocks, it has 
the specific representation 


^ i i P (z\)^ j pr {zi) = ^F pq \ 
<7 L 


j 

r 


^l(z 2 W qj (zi - z 2 ) 


(3.5.9) 



3.5 Racah Coefficients for Rational Conformal Field Theory 87 


j 


k k 


] 


i 


J 


i 


I 

k 

/ 

F 

\ 

I 

FIGURE 3.1. 


j k 


I 


I I 


(where, for convenience, we have suppressed the summation over descendant 
fields). Symbolically, this can be represented as 

V l jp®Vu^V i q i®V] k . (3.5.10) 

These identities are quite interesting because, by iterating a series of manip¬ 
ulations with the B and F operators, we can deduce a large number of nontrivial 
relations. These identities, in turn, are analogous to the identities found among 
3 j Clebsch-Gordan and 6 j Racah coefficients in the usual SU{2) theory of 
addition of angular momentum. 

We can also introduce a new operator Q, which simply twists two external 
legs around an internal leg. Actually, Q is not a new operator, but can be 
represented by setting one of the legs of the B matrix to the identity, that is, 


B 

i 

0 

1-1 

(3.5.11) 

defines the map 




that is, Q twists the j, k lines. 


v kj’ 

(3.5.12) 


An explicit representation for Q can be found because it satisfies a number 
of constraints. We find 


pq 


j k 

i / 


£2^ — £j^'(A 7 +A*-A^ (ft 1 '*) 2 = e l7li ^j + A*-A,) 


(3.5.13) 


where £ = ± 1 depending on which conformal field theory we are analyzing. 

We can now deduce nontrivial identities among the F, B, £2 matrices. In 
Fig. 3.2, we see how a four-point block can be deformed in two ways, by 
the operation of BF or FQ , leading to the same result. Equating the two 



88 3. WZW Model, Cosets, and Rational Conformal Field Theory 


j k 
i-■- 


B 



j 


F 


F 


j k 


k j 


i 


I 


to 


FIGURE 3.2. 


i 


I 


operations, we obtain a representation of this figure by 





P' 


l W', 


k j 
i l 


= F } 


pq 


7 * k 

i l 


e -in€(A k +&j-& q ) 


(3.5.14) 


where € represents the sense of the braiding. 

In order to obtain more identities, we also note that the B and F operations 
can be pictorially represented as in Fig. 3.3. Since both B and F transform 
four-point functions into other four-point functions, in Fig. 3.3 we have simply 
sandwiched these two four-point functions back to back. For each B or F, there 
is a unique representation in terms of the diagrams in Fig. 3.3. 

With this new identification for B and F, we can write an identity 
corresponding to Fig. 3.4: 


j: 

’<7i 

Ji 

7*4 

7*5 _ 

(0 



u 

js _ 

m,* j 

i 4 ] <e) M£ i]' (3si5) 


(We can read this identity by tracing the indices in Fig. 3.4. In the first diagram 
in Fig. 3.4, the F operation appears in the lower left, and the B operation 
appears in the upper right. In the second diagram in Fig. 3.4, the F operation 
appears on the upper right, and the two B operations appear on the left and 
lower right.) 

Now set j 5 = 0. Because the B matrix reduces to a twist ft, we can compare 
the previous two equations and find an identity between the B and F matrices 


pq 




-in€(Ai + A k -A p -A q ) ^ 


pq 


j i 

i k 


(«). 


(3.5.16) 



3.5 Racah Coefficients for Rational Conformal Field Theory 89 



FIGURE 3.4. 


Now let us analyze Fig. 3.5, where we have two sequences of operations. 
Notice that the beginning and final result is the same. By explicitly representing 
each of the five manipulations shown in the illustration, we will be able to 
represent the “pentagon relations” [14]. 

By carefully labeling all the legs and decomposing each of the manipulations 
in terms of the F and B matrices, it is easy to show an identity between BBF 
and FB , given explicitly by 



(3.5.17) 



90 3. WZW Model, Cosets, and Rational Conformal Field Theory 



FIGURE 3.5. 


Similarly, we can also construct the “hexagon relations” by tracing the 
manipulations shown in Fig. 3.6. We equate the action of FQF and QFQ. 
Explicitly, the hexagon identity becomes 


&Tk( € )F mn 


j 

i 




l 

k 


& l kM) F rn 


k j 
i l 

(3.5.18) 


At the genus 0 level, one can show that one pentagon identity and two 
hexagon identities are enough to completely specify all possible higher-order 
identities. This means that octagon (and higher) identities for tree graphs can 
be reduced to combinations of pentagon and hexagon identities. (For the higher 
loop graphs, one must introduce two more operators, which we define in the 
next chapter, to completely specify all possible polynomial equations.) 

One interesting observation is that we have now reduced rational conformal 
field theory to a set of finite polynomial equations. This, in itself, is remarkable 
because conformal field theory is usually defined over an infinite-dimensional 
space. By reducing everything to the finite-dimensional space of primary fields, 
we have a remarkable way in which to specify any rational conformal field 
theory strictly through their pentagon and hexagon graphs. 

In some sense, this may even serve as an alternative, finite-dimensional 
definition of rational conformal field theory. 


Example: Ising Model 

One of the simplest examples of this construction is the Ising model, which 
is equal to the first minimal model at criticality. We recall that for c = %, the 
allowed values of h for the primary fields were \ and 



3.6 Summary 91 



FIGURE 3.6. 


Let us represent the Verma modules associated with each primary field as 
[i/f] and [a]. Then, the fusion rules can be represented as 

m x m = [i], 

M x [a] = [a], (3.5.19) 

[a] x [a] = [1] + m. 


Let us now represent the B and F matrices in terms of these primary fields. 
There is a certain freedom in choosing the values of these matrices, so we can 
always choose a gauge by setting 


xjr 


= F 


xfr 


a 

a 


= F 


a 


xfr 

a 


= F 


a 


a 


= 1. (3.5.20) 


By solving the polynomial equations, we arrive at [14]: 

1 f 1 


[f f\~ ’ 

; t\- 


1, 


B 


a 

a 


a 

a 


\ -.)• 

\/2 G l)' 


V2 \ 


(3.5.21) 


3.6 Summary 

Systems with a finite number of primary fields give us a theoretical laboratory 
for testing our ideas about string vacuums. In particular, the minimal models 
exhaust all possible unitary representations of the conformal group with a finite 



92 3. WZW Model, Cosets, and Rational Conformal Field Theory 


number of primary fields, and they have correlation functions and fusion rules 
that can be calculated exactly. 

To go beyond minimal models, we study Kac-Moody algebras. It is possible 
to find new representations with a finite number of fields that are primary with 
respect to both the Virasoro and Kac-Moody algebras. 

One way to find representations of these algebras is to take the string and 
let it move on some curved manifold 

L = -G» V (X) d a x» d b x v g ab + ---. (3.6.1) 

7T 

Specifically, we will take the following a model action: 

S = ^f d°g) d 2 H + kr(g), (3.6.2) 

where the last term is a Wess-Zumino term 

r( g) = ^f d ' X 9 «2)0r' dyg)]. (3.6.3) 

where the last term is integrated over a three-dimensional disk whose boundary 
is two-dimensional space-time. 

For k = 1,2,..., the theory becomes effectively massless and possesses an 
infrared-stable fixed point at 

A 2 — 4tt /k. (3.6.4) 

The currents for the WZW model are 

J = -\kd z gg-' = J a t a , 7 = -\kg- 1 d- z g=Tt a . (3.6.5) 
The 7’s generate an algebra given by 

[C J h m ] = f abc j; +m + \kn8 ab 8 n+m ,o. (3.6.6) 

One of the most important representations of the Virasoro generators is 
obtained by 

nz) = X : r(z)J‘(z.) : = X £ : : r"' 2 , (3.6.7) 


where 


K = -\{C V + k), fobcfbcd = ^ab f (3.6.8) 

where c v is called the second Casimir of the adjoint representation of the Lie 
algebra. 

Written in component form, we have the Virasoro generators written as 

1 °o 

- V : J“J a n m : . 

c v +k ^ m n ~ m 

v 1 m =—oo 


U 


(3.6.9) 



3.6 Summary 93 


If we commute two generators of the Virasoro algebra written in Sugawara 
form, we find 


kD 

c =-, 

Cy “I - k 


(3.6.10) 


where D is the dimension of the group. 

One of the most powerful methods of generating new conformal field 
theories is the GKO coset method, which uses the representations of a group 
G and a subgroup H. 

Using the Sugawara construction for T G in terms of J G , we can calculate 


T G {z)J a H (w) ~ 


J a H (w) | dJ a H (w) 
(z — w) 2 z — u> 


But, we also know 


(3.6.11) 


T„{z)J a H (w ) 


J a H (w) dJ a H (w) 
( z-w ) 2 z - w 


(3.6.12) 


Notice that the right-hand side of both equations is the same. Thus, if we 
subtract the two equations, the right-hand side will equal zero. 

If we write 


Tg — (T g — Tfj) + Th = T G /h + Tu, 


(3.6.13) 


then we also have 


[T g/h ,T h ] = 0. (3.6.14) 

This last equation means that T G can be split into two mutually commuting 
pieces, T G /h and T H , both of which generate representations of the conformal 
algebra (but with different values of c). If we now calculate the operator product 
expansion for T c , we find 


2 (z- w ) 4 

In other words, we now have the final expression 

ka\G\ k a \H\ 


C G /H — C G ~ Ch 


(3.6.15) 


(3.6.16) 


k G +h G kn + hn 
A simple example of how the coset method can generate conformal field 


theories is given by the following: 

G/H = SU(2) k (8) SU (2),/ SU (2)* +1 . 
The value of the central term is therefore given by 

3k 3{k + 1) . 6 


c G /h = 


+ 1 


= 1 - 


(3.6.17) 


(3.6.18) 


k + 2 ' ‘ (* + l) + 2"‘ (k + 2)(k + 3) ’ 

which is precisely the discrete sequence of the minimal unitary models for 
m = k + 2 = 3, 4, 5,_ 



94 3. WZW Model, Cosets, and Rational Conformal Field Theory 


Thus, we have the correspondence 

unitary series ** SU(2) k ® SU(2)i/SU(2) k +\. (3.6.19) 

Yet another sequence is given by 

G/H = SU(2) k ® SU(2) 2 /SU{l) k+1 (3.6.20) 

for which we have 

3 k 3 3(Jt + 2) 3/ 8 \ 

CaH -TT2 + 2- ik + 2) + 2 = 2\'~ (t + 2Xt+4)j- <3 ' 6 ' 21) 

We immediately recognize this as generating the superconformal N — \ 
discrete series for m = k + 2. Thus, we have the correspondence 

N = 1 unitary series SU(2) k ® SU{2) 2 /SU{2) k+1 - (3.6.22) 

For rational conformal field theories, like the minimal models, one can con¬ 
struct Ward-like identities on the correlation functions which allow us to solve 
them exactly. If we insert J into a correlation function and then commute it 
past the fields, we find 

\ k T~ -]C 'J. }{8(zuZi)---g(z N ,ZN)) = 0. (3.6.23) 

\ ° Zi j# Zi z > f 

This is the Knizhnik-Zamolodchikov relation, which is useful in actually 
calculating explicit expressions for the correlation functions. 

Finally, it can be shown that if a conformal field theory has a finite number 
of primary fields then h, c are rational. These are called rational field theories 
and include the WZW and minimal models as subsets. It is believed all known 
rational conformal field theories can be generated by the coset method. 

Rational conformal field theories can be treated much like ordinary Lie alge¬ 
bras, where we calculate Clebsch-Gordan coefficients and Racah coefficients. 
On the space of conformal blocks arising from four-point correlators, we can 
define two operators, B and F, which have well-defined geometric interpre¬ 
tations. Starting with an initial conformal block, there is more than one way 
in which to use the B and F operators to deform the original block into a 
new one. By equating the various ways in which this is done, we therefore 
obtain a series of identities, such as the hexagon identity, which have a direct 
counterpart in classical Lie algebras. 

The advantage of this technique for rational conformal field theories is that 
we can find a unique prescription for calculating all such blocks and hence 
calculate all correlation functions. Like the minimal models for conformal 
theory, the rational conformal field theories for the Kac-Moody theory can be 
completely solved by these techniques. 



References 95 


References 


1. For a review of Kac-Moody algebras, see R Goddard and D. Olive, Int. J. Mod. 
Phys. Al, 303 (1986). 

2. Th. Kaluza, Sitzungsber Preuss. Akad. Wiss. Kl, 966 (1921). 

3. E. Witten, Comm. Math. Phys. 92, 455 (1986). 

4. S. R Novikov, Uspekki Mat. Nauk. 37, 3 (1982). 

5. D. Gepner and E. Witten, Nucl. Phys. B278, 493 (1986). 

6. V. Kac, Infinite-dimensional Lie Algebras, Birkhauser, Basel (1983). 

7. R. V. Moody, Bull. Amer. Math. Soc. 73, 217 (1974). 

8.1. Frenkel and V. G. Kac, Invent. Math. 62, 23 (1980). 

9. M. Halpem, Phys. Rev. D12, 1684 (1975). 

10. G. Segal, Comm. Math. Phys. 80, 301 (1981). 

11. V. A. Kostelecky, O. Lechtenfeld, W. Lerche, S. Samuel, and S. Watamura, Nucl. 
Phys. B288, 173 (1987). 

12. P. Goddard, A. Kent, andD. Olive, Comm. Math. Phys. 103, 105 (1986); Comm. 
Math. Phys. 103,105 (1986). 

13. V. Knizhnik and A. B. Zamolodchikov, Nucl. Phys. B247, 83 (1984). 

14. G. Moore and N. Seiberg, Lectures on RCFT , 1989 Trieste Summer School; 
Phys. Lett. 212B, 451 (1988); Nucl. Phys. B313, 16 (1989); Comm. Math. Phys. 
123, 77 (1989); Phys. Lett. 220B, 422 (1989). 

15. K. S. Narain, Phys. Lett. 169B, 41 (1986). 

16. W. Lerche, D. Lust, and A. N. Schellekens, Nucl. Phys. B287, 477 (1987). 

17. H. Kawai, D. C. Lewellen, and S.-H. Tye, Phys. Rev. Lett. 57,1832 (1986); Phys. 
Rev. D34, 3794 (1986); Nucl. Phys. B288, 1 (1987). 

18.1. Antoniadis, C. P. Bachas, and C. Kounnas, Nucl. Phys. B289, 87 (1987). 

19. L. Dixon, J. Harvey, C. Vafa, and E. Witten, Nucl. Phys. B261, 651 (1985); B274, 
285 (1986). 



CHAPTER 4 


Modular Invariance and 
the A-D-E Classification 


4.1 Dehn Twists 

Up to now, we have seen that conformal invariance by itself is not sufficiently 
restrictive to give us the fusion rules and conformal blocks of any conformal 
field theory. Thus, we need to add an extra constraint to fix the theory. The 
next constraint that we will impose is modular invariance [1], a powerful tool 
which will allow us to fix many of the features of a conformal field theory and 
which yields many new surprises. 

To understand why we want strings to be modular invariant, recall that the 
string amplitude is given as a path integral sum [Eq. (1.2.1)] over all possible 
conformally distinct Riemann surfaces [2, 3], so that we do not introduce 
overcounting into the path integral. However, we should distinguish between 
two types of transformations. 

The familiar class of conformal transformations are those that can smoothly 
be deformed back to the identity. By smoothly changing the parameters that 
typify a conformal transformation, we can slowly change it back to the identity. 

However, there are also other kinds of conformal transformations that we 
must subtract, which are those that cannot be smoothly deformed back to the 
identity, the global transformations. To visualize these global transformations, 
notice that there are only two ways to draw circles on a torus, called the a 
cycle and b cycle, such that the circles are not contractible to a point. If we 
slice the torus around the a cycle and b cycle and unravel the torus, we find 
that we have a parallelogram, such that points on opposite sides are identified 
with each other. Thus, a torus is topologically equivalent to a parallelogram 
with properly identified sides. 

The simplest global transformation for a torus is given by a Dehn twist , 
which involves slicing a torus along an a cycle, rotating one of the open ends 




4.1 Dehn Twists 


97 




FIGURE 4.1. 

by 360°, and then resealing the cut. Equivalently, we could have sliced the 
torus along a b cycle, rotated one end of the slice by 360°, and then resealed 
the cut. 

Notice that if we trace the motion of the various points on the torus we can see 
that these two transformations cannot be smoothly mapped back to the identity. 
Moreover, these two transformations can be viewed from the perspective of 
the parallelogram. 

Let us place the parallelogram onto the complex z plane as in Fig. 4.1. 
Notice that we have arbitrarily fixed the bottom leg to have a length equal to 1. 
The parallelogram can then be uniquely fixed by placing the upper left comer 
at point r, a complex number. Given a fixed value of r, we have uniquely 
specified a toms. 

However, under the action of Dehn twists, the toms is mapped back into 
itself, so we must demand that all our amplitudes be modular invariant, that 
is, invariant under the action of Dehn twists. Specifically, if we make the 
transformation called T , 


T: r —> r + 1 (4.1.1) 

we see that the upper left-hand comer moves to the right by one unit, but this 
just corresponds to a Dehn twist. 

A bit more delicate is the other Dehn twist, because we have arbitrarily 
fixed the length of the bottom leg to be 1. The other Dehn twist corresponds 
to interchanging the a cycle and b cycle, or flipping the parallelogram onto 
its side. However, if we place the parallelogram on its side, then we have to 
rescale the entire parallelogram so that its bottom side has length 1. Then, if 
we now calculate the position of its upper left comer, we find that it has made 



98 4. Modular Invariance and the A-D-E Classification 

the following transformation: 


S'- x -1/t. (4.1.2) 

Thus, we demand that all loop amplitudes of the string be invariant under the 
T and S transformations (or else we will be counting the same torus an infinite 
number of times). 

The group generated by repeatedly taking S and T transformations is called 
the modular group SL( 2, Z) [4-6], which is specified by the following: 

ax + b 

x -—j, (a,b,c,d eZ; ad -be = 1). (4.1.3) 

(The transformation remains the same if we simultaneously reverse the sign 
of a, b, c, d. We can thus take the smaller group PSL( 2, Z) = SL( 2, Z)/Z 2 
as the modular group.) 

The parameters of the group are integers, not real numbers, which accounts 
for the difficulty in writing the representations of this group. 

In ordinary Lie group theory, we wish to write the character, a function 
that immediately tells us how large a representation of an algebra is. For a 
Verma module [7, 8], we wish to write a function of x such that its nth Taylor 
coefficient tells us how many elements the representation has at the nth level. 
Recall that the Verma module, for example, at the third level, is given by 

LljO), L_iL_ 2 |0>, L_ 3 |0>, (4.1.4) 

that is, the number of elements of an irreducible Verma module at the nth level 
(with no null states) is given by the number of ways in which n can be broken 
down into integers. This number is called p(n ), or the partition of the integer 
n. 

Thus, we can define a function of x such that the coefficient of x n tells us 
the number of elements in the Verma module V n at the nth level 

00 00 

Tr x " dim V n = J2 (")• (4.1 -5) 

n =0 n =0 

However, let us use the formula 

00 oo 

f[(l -*T' =£>>(«). (4.1.6) 

n =1 n =0 

Let us also define the Dedekind rj function (as a function of q = e l7tix or of r) 
as 


v(q) = <7 1/24 flo - ?") = q 1/24 (Tr<7 io ) 1 , 

n =1 


(4.1.7) 



4.2 Free Fermion and Boson Characters 99 


which transforms as follows under the modular group 

r)(— 1/t) = (—/r) 1/2 ??(r), 

(4.1.8) 

t}(t + 1) = e n,/i2 j](r), 

where we have rewritten the Dedekind function as a function of r. 

We now write the character of an irreducible Verma module (over complex 
q) as 


Z{q,q) = q- c/24 q~ c/2i T:rq L,> q Lo . (4.1.9) 

(The origin of the c/24 term is a bit tricky. In the trace, notice that we 
have implicitly made a conformal transformation from the complex plane to 
a cylinder, given by w -»• z = e w . However, as we saw in Chapter 2, the 
energy-momentum tensor does not transform homogeneously under a confor¬ 
mal transformation, but picks up a quantity proportional to the Schwartzian 
[Eq. (2.2.24)], that is, 

Tcyiin(w) = T(z) + ^5(z, w), (4.1.10) 

where the Schwartzian can be computed to equal S(e w , w) = — \. This means 
that the L 0 defined on the cylinder is not the L 0 that we defined on the complex 
plane: 


(f'O)cylin ~ ^0 (c/24). (4.1.11) 

Normally, this term can be thrown away, since we are only interested in count¬ 
ing the states at each level. However, when we compute the effect of modular 
transformations, this factor becomes crucial in proving modular invariance. 
Traditionally, this factor comes from zeta-function regularization, but this 
obscures the conformal nature of its origin.) 

The next step is to calculate the transformation properties of these characters 
under modular transformations in order to calculate functions that are invariant 
under the modular group [9-11]. This will, in turn, place nontrivial restrictions 
on the conformal field theories that we have been studying, including the N = 1 
superconformal theories [12, 13]. The input of modular invariance, therefore, 
will prove to be a powerful tool by which to put restrictions on the vast number 
of conformal field theories and their representations that we have found. 


4.2 Free Fermion and Boson Characters 

Let us now try to calculate the modular functions corresponding to a more 
complicated models, a free fermion with c = f and a free boson on a torus 
with c — 1. 



100 4. Modular Invariance and the A-D-E Classification 


Example: Free Fermion 

There is a trick we will use when calculating fermionic partition functions. 
If \l/- n is a Fourier mode of r/r(z) 9 corresponding to a creation operator, then 
we know that the Fock space spanned by this oscillator is trivial, that is, only 
one ijs-n can act on the vacuum at any given time, since it is a Grassmann odd 
variable, that is, ^JO) = 0. 

The trace over the nth fermionic oscillator mode is therefore easy to perform, 
since the trace consists of only two elements, the vacuum and |0). 

If F equals the fermion number, then we have 

Tr q n +-*+* = 1 +q n , 

(4 2 11 

Tr(-l)V*-*- = 1 ^ * 

[The insertion of the factor (— \) F into the trace converts the 1 + q n into a 
1 - q n term.] 

The entire Fock space, however, consists of monomials one can create out 
of various products of for different values of n, so we can convert the 
trace, which is a sum over states, into a product over different Fock spaces 

Tr (q) L ° = Tr (qj£* *+-+' = Tr ["[fo)"*-*- = [“[(1 + q n ), (4.2.2) 

n n 

where the sum n is over positive integers for the NS fermions and positive 
half-integers for the R fermions. 

However, there is a complication in the sum due to the boundary conditions 
on the fermions, that is, we can take different periodic and antiperiodic bound¬ 
ary conditions on the torus. If we take the trace over q L \ we must first specify 
whether we are tracing (in the r direction) over Ramond fermions (which are 
periodic) or Neveu-Schwarz fermions (which are antiperiodic). We will de¬ 
note this by Tip and Tr A . But, we must also specify the boundary conditions 
in the a direction as well as in the x direction. Since the boundary conditions 
can be either periodic or antiperiodic in the r or a directions, we have a total 
of four possible boundary conditions and hence four possible traces. 

The four different boundary conditions on the torus or parallelogram define 
what is called the spin structure on that surface. 

We will use the symbol x(A, P ), for example, to denote the trace over an¬ 
tiperiodic (periodic) boundary conditions in the a (r) direction. In the case 
of x(A, P)> w o will simply trace over Ramond fermions: q~ l/4S Tr P q L °. No¬ 
tice that the trace over Ramond fermions automatically specifies antiperiodic 
boundary conditions in the a direction. 

If we wish, however, to calculate x(P, P), which has periodic conditions in 
the a direction, then we must insert the operator (— 1) F , where F is the fermion 
number, in order to reverse the periodicity. 

Given these rules, it is now a simple matter to write out in detail the four 
possible traces corresponding to the four possible spin structures for the torus. 
Because all trace operations can now be converted into products over different 



4.2 Free Fermion and Boson Characters 


101 


Fock spaces labeled by n, we now have 


X(A, A) = <T 1/48 Tr^ q L ° = q - ,/48 f](l + q n+x ' 2 ) = 


X (P, A) = q-'W Tr A (-l) F q L ° = <T’ /48 ~ 

n=0 ’ V (4.2.3) 

X (A, P) = 2= ^- 1/48 Tr P «? 1/24 0(1 + q n ) = 

V2 V2 „ A J V »* 

1 1 °° 

*(/>, P) = <T 1/48 Tr P (-l) V° = 4 1/24 ^ ~ *") = °’ 

V2 n=0 

where the < 7 ) are the usual Jacobi theta functions (taken with 

one variable set to zero). These functions can be defined through the above 
equations as infinite products, or as infinite sums via 


#1 (v,q) = i (-l ) n 9 


l/2[n-(l/2)] 2 ./jr(2n-l)v 


^(v, q) = Y! % 


l/2[«-(l/2)] 2 in(2n-\)v 


fi 3 (y,q)= £ q nl l 2 e 2 * in \ 


(4.2.4) 


Mv,q)= £(-l ) n q n2/2 e lninv . 


We should also mention that the 0 functions can also be written as infinite 
products 

00 

2 ?x(v, q) — 2qoq l/s sinjrv ]~[ (l — 2 q n cos7rv + q 2 "), 


i? 2 (v, q) = 2qoq 1/s cos 7 rv J~[ (l + 2 q n cos2nv + q 2n ), 

n =1 

00 

^ 3 (v, q) = ]”[ [l + 2q n ~ 0/2) cos2nv + q 2n ~ [ ]. 


(4.2.5) 


# 4 (v, q) = qo ]~[ [l - 2q n ( 1 / 2 ) cos nv + q 2n '], 


where qo = i(1 — q")- (Our value of q is the square root of the value 
quoted in Ref. 14.) 



102 4. Modular Invariance and the A-D-E Classification 


We now would like to reanalyze the partition functions for the c = | fermion, 
rearranging the characters according to the irreducible representations of the 
conformal group, that is, according to h,c. Comparing the characters found 
above with the characters found in the usual Ising model, where we also have 
c = \ , we find from Eq. (2.5.16) that there are three primary fields with weights 
given by 

{*1.1, *2.1, *2.2} = (o, f (i y. (4.2.6) 

Our task is now to rearrange the partition functions found earlier so that we 
have only the characters of irreducible representations. To do this, we note that 
the Verma modules are built up by multiplying the vacuum by L_„, which does 
not change the overall fermion number of the state. Thus, the Verma module 
with h — 0 is built of states with even fermion numbers and integer eigenvalues 
of Lo, while the module with h = 4 is built of states with odd fermion numbers 
and half-integral eigenvalues of L 0 * 

Thus, instead of arranging the character depending on whether the boundary 
conditions are periodic or antiperiodic, we will now rearrange the module 
according to whether the module has an even or odd fermion number. The way 
to do this is to insert |[1 ± (— 1) F ] into the trace, which selects the states with 
definite fermion numbers. 

Given this decomposition according to the fermion number, the characters 
in this representation can be defined via 

Xo = <? -1/48 Tr A=0 q L ° = q~ 1/4S Tt a i[l 

X ./2 = q-' m Tr k= \ /2 q L ° = q~ l/4S Tr A i[l - (-1) F ]^, (4.2.7) 

Xi /16 = q~ im Tr h=l/l6 q Lo = q~ x/4% Tr P \[\ ±(-l) F ]^ io . 

(In the last expression for h = A, we used the fact that the trace over (— 1 ) F q Lo 
equals zero due to a cancellation between equal numbers of states with different 
fermion numbers.) However, since we know how to evaluate all the traces in 
the above expression in terms of the four periodic and antiperiodic characters 
we wrote before, we now can write the complete expression for each of the 
three Verma modules [4]: 

Xo = \ [X(A, A) + X (P, A)] = 5 ) - 

X./2 = -2 tx(A. A) - x(P, A)1 = \ ) , (4 2 8) 

Xi/i6 = 4 = tx(A, P) ± X(P, ^)] = 4 =./— • 

V2 V2V V 

The point of this discussion, of course, is to calculate modular invariant combi¬ 
nations of the x • Let us now calculate, therefore, how each of these x functions 
change under a modular transformation. 

Under the operation T , we see that the top of the parallelogram is shifted 
one unit to the right, as in Fig. 4.1. We can now simply determine how the spin 



4.2 Free Fermion and Boson Characters 103 


structures change under this operation. For example, we can start with a torus 
with the (A, A) spin structure and apply the operation T . When we move from 
the origin to the point r, we pick up a factor of (—1). When we now move from 
the origin to the point r + 1, we pick up a factor of (—1)(—1) = +1, which 
is periodic. By moving in the a direction on the new torus, we now pick up a 
factor of+1, so it is now periodic in this direction. So the spin structure, under 
the T operation, has now changed to ( P , A). 

Likewise, we can, by simply moving from the origin to the point r + 1, 
determine how all four spin structures change. By explicit calculation with the 
known values of x, we find 


T : 


X(A,A)^e^ /24 X (P, A), 
X(P, A) e~ llt/24 x(A, A), 
X(A, P) —»• e in/l2 x (A, P). 


(4.2.9) 


We can also calculate how the spin structures change under the operation S 
(which reverses the r and a directions). Using the simple-minded rule given 
above, we find the following transformations of the traces under S : 


5 : 


X(A, A) —► x(A, A), 

X(A,P)-+ X (P,A), 
X (P » A) —> X (A, P). 


(4.2.10) 


Notice that none of the x are modular invariant by themselves. To obtain 
modular invariant functions, we must also include complex conjugates of the 
X . We can satisfy T and S invariance by multiplying the various characters 
with their complex conjugates (which eliminates the phase factor introduced 
via T transformations) and then choosing the correct combination of absolute 
values of characters to get complete modular invariance. 

The goal of this discussion is to formulate a modular invariant partition 
function for the c = \ fermions, which we see are now given by 

Zising = + \^/v\ + I02/I/I ± l<V'?l) 

= XoXo + X 1 / 2 X 1/2 + X 1 /I 6 X 1 / 16 - (4.2.11) 

In general, when we discuss increasingly more complicated characters for the 
Virasoro and Kac-Moody representations, we will take the ansatz that the 
final modular invariant partition function will be a bilinear sum over both 
holomorphic and antiholomorphic representations, each corresponding to the 
various primary fields, that is, 

Z{q, q) = Yl N khXh{q)Xhiq )• (4-2.12) 

hji 

We will make modular transformations on this bilinear combination in order to 
obtain constraint equations on the coefficients N h ^ thus yielding the invariant 
solution. 



104 4. Modular Invariance and the A-D-E Classification 


These methods developed for the characters of free fermions can be carried 
directly over to the partition function over the free boson defined on the torus, 
such that X = X + 2k r. 

Example: Free Boson on a Torus 

The trace we are interested in is 

X=(qqT c/2A Trq L °q L °. (4.2.13) 

Special care, however, has to be given to the zero mode sector in the trace. 
Because of periodic boundary conditions, momentum is quantized on the torus, 
and because the string can wind around the torus, we have to introduce the 
winding number. Thus, two integers are required to describe compactification 
of a closed string. We thus have 

m _ m 

<*o = Pl = — + nr ' uo = Pr = - - nr, (4.2.14) 

2 r 2 r 

for integers n and m . This means that we have to sum over states | m,n), indexed 
by two integers, and then multiply by arbitrary numbers of creation operators. 
The eigenvalues of L 0 and Z 0 on these states are easily calculated. Let 

\{ni},[mj},m,n) == ( J~[«X-) (IT""•y)l m ’ ( 4 - 2 - 15 ) 

* j 


for some collection of integers w,- and mj. 

Then, we have 

Lo\{rii}, {nij},n,m) = Tl ni + \(^ +nr ) \{ni},{mj},n,m). 

(4.2.16) 

(The equation for L 0 is the same, except that the term +nr is changed to —nr 
on the right-hand side, and we replace n t with mj.) 

The partition function therefore splits into two pieces. The first piece simply 
records the number of ways in which we can create states multiplied by mono¬ 
mials in the creation oscillators a*. This part is easy to compute and gives us 
n„(l — q n Y l • This contributes a factor of 1 /qrj. 

The second part is the summation over the zero modes, which in turn are 
indexed by two numbers. Putting everything together, we now have 


1 oo 


1 /2(m/2 r+nrf - 1 /2(m/2r-nrf 


(4.2.17) 


This expression is modular invariant by itself. (If we make the transformation 
r -*■ x + 1, the r)T) is invariant because they only change by a phase. The zero 
mode part picks up a phase exp ni(pl — p|), which equals unity when we plug 
in the values for the momenta. If we make the transformation r -*■ — 1 / r, the 
calculation is a bit more difficult, but can also be performed by reversing the 
boundary conditions on the torus.) 



4.3 GSO and Supersymmetry 105 


4.3 GSO and Supersymmetry 

Let us now apply some of these techniques to the D = 10 superstring, inves¬ 
tigating the surprising link between supersymmetry and modular invariance. 
We recall that the NS-R model by itself is not space-time supersymmetric. 
However, we can recover space-time supersymmetry by imposing the GSO 
projection [15]. 

Let us define the G -parity operator by 

G = (— 1)£»=>/ 2 ( 4 . 3 . 1 ) 

We will now take the even G -parity sector of the theory. This truncation of the 
NS-R sector has several important implications. 

First, it eliminates the troublesome tachyon that appears in the bosonic 
theory. Second, it restores space-time supersymmetry. To see this, it is most 
convenient to work with the light cone quantized NS-R string, where we can 
prove that the number of states of the fermionic sector equals the number of 
states in the bosonic sector. 

In the NS sector, the even G -parity sector is equal to the trace over the 
following: 

F*ns = < 7 -1/2 Tr[^(l + G)q R \ 

00 00 (A^l\ 

R = ^ na\a n + ^ rbjb r . 

n= 1 r=1/2 

Using the previous techniques, we can show that this partition function equals: 

pns= k- ,,J nr.iO -?">-■ [rr.,o+<r ,,2 ) ! - nr.,a 

(43-3) 

Next, we can set up the Ramond sector partition function as 
Pr = 8 Tr(< 7 R ), 

~ f + ( 4 . 3 . 4 ) 

R = Y2 n ( a l a " + d l d n)’ 

n =1 

(where the 8 comes from the fact that, in the light cone gauge, only these 
components of the spinor survives). This can be shown to equal: 

00 

P K = 8 [f(l - <zT 8 (l + q n f. (4.3.5) 

n =1 

It was recognized in 1829 by Jacobi that these two expressions are equal: 

Pns = ^ R . ( 4 . 3 . 6 ) 

This formula has great implications for the superstring. Not only does it show 
that the NS-R model is space-time sypersymmetric after the GSO projection, 
but it also shows the power of modular invariance. Notice that we inserted the 



106 4. Modular Invariance and the A-D-E Classification 


projection operator 5 [ 1 + (— 1 ) F ] into the trace, which is precisely the s ummin g 
over the (A , P) and (P, A) sectors. Modular invariance means invariance under 
the interchange and transformation of the homology cycles, which is precisely 
what the insertion of the GSO operator performs. 

For the superstring theory, it means that, in some sense, modular invariance, 
the GSO projection, and space-time supersymmetry are all interdependent. 
Since modular invariance is necessary for a unitary theory, this implies that 
space-time supersymmetry is necessary for the internal consistency of the 
theory. 

Space-time supersymmetry, far from being a luxury for the theory, now 
appears absolutely essential for the self-consistency of the entire theory. The 
importance of this fact will become even more important in the next chapter, 
where we will discuss different supersymmetric compactification schemes. 


4.4 Minimal Model Characters 


Let us proceed to the more complicated case of the minimal models. Let us 
calculate the character in two different ways. First, we will take the above 
expressions for the character of an irreducible representation and generalize it 
to the case where null states are present. We will derive the character formula 
by carefully subtracting out all the null states, leaving us with the character of 
the minimal model. Second, we will use the coset construction of the previous 
chapter and calculate the character of the minimal models by reexpressing it in 
terms of the affine SU(2) k via Eq. (3.3.10). Finally, let us recall that the Ising 
model at criticality is equivalent to the minimal model, with m — 3 in Eq. 
(2.5.24). So, our new results on the character of the minimal models should 
give us an independent check on the previous formula. 

We begin by noting that, if there are no null states at all, then the character 
is given by 

Xh{q) - q~ c/24+h ^2 p(n)q n = <7 _(c ~ 1)/24 r)(q)~ x q h , (4.4.1) 


since the number of states at level n for this irreducible case equals the partition 
of that integer p(n). (We have inserted a normalization factor q h , which will 
simplify our discussion.) Let us now generalize this formula for the minimal 
model. Recall that the weights of the fields in the minimal models are labeled 
by two integers and are given by Eq. (2.5.7): 


[r(m + 1) — sm] 2 — 1 
4 m(m +1) 


(1 < s < r < m — 1). (4.4.2) 


If we analyze the Verma module we notice that it contains a null state x 

at level rs , which therefore has weight given by h rs + rs. We must, therefore, 
explicitly remove from the character the states given by the module generated 

by x- 



4.4 Minimal Model Characters 107 


Let us now analyze the character x rs of the m th minimal model. If we subtract 
the contribution from the null states, we find 

Xrs (q) = «T (C - I)/2 V<7)-Y"(l - q rs + • • •)• (4.4.3) 

However, we cannot stop here. It turns out that the module [x] itself contains 
a null state, at level ( m + r )(m + 1 — s) because 

h rs + rs - /i m+r>m+1 _ s (4.4.4) 

so it would be overcounting to simply remove the module x • Instead, we must 
carefully subtract this new null module from the first one. 

Subtracting this new null module, we find 

X™ = - q h ^[ 1 - ? (»+**»+i-*) + ■•.]} 

_ q~ {c ~ l)/24 rj(q)~ l (q hrs - q hr ~ s + -). (4.4.5) 

Not surprisingly, this process continues forever, with null states within the 
module generated by the previous null state. Therefore, an infinite succession 
of subtraction of these factors is necessary. The final result can be found by 
summing the various subtractions: 

00 

Xrs = q~ (c ~ X)/2A n(q)~ x ^gh 2mt+r . s _ q h i^-,y (4.4.6) 

k=—oo 

This is the desired character for the mth minimal model. 

Now let us calculate the transformation properties of this character under S 
and T. It is straightforward to show that 

T : Xhiq) ^ e 2 «‘ (h - c/24) Xh (q)- (4.4.7) 

A bit more difficult is the calculation of the character under an S transformation 

S'. Xrs(q )-» WAS) 

pq 

where 

1/2 

(-1 ) (r+i,(p+9) sin J ^~ sin _^£*L. (4.4.9) 

This is a rather remarkable formula, because encoded within the 5 matrix is a 
finite-dimensional representation of the modular group. However, notice also 
that Xrs by itself is not modular invariant. It transforms “covariantly” under the 
modular group, and hence, it is not an invariant. To form a genuine invariant, 
we must take various combinations of bilinear sums of representations in order 
to obtain an invariant character as in Eq. (4.2.12). 

One set of invariants can be calculated by observing that the matrix obeys 
S 2 = 1, with real elements. Then, it is easy to show that the following diagonal 
form for the N matrix is modular invariant 


S™ = 


8 


I m(m + 1)J 


Nhh — &hh- 


(4.4.10) 



108 4. Modular Invariance and the A-D-E Classification 


As a check on our results, we take the case m = 3, which corresponds to 
the Ising model, with three primary fields. Notice that the diagonal N matrix 
yields the following form for the invariant: 


2 — IXoP + I/ 1 / 2 I 2 + IXi/i61 2 » (4.4.11) 


which is precisely the form of the modular invariant found earlier in Eq. (4.2.11) 
when analyzing the c — \ fermion system. 

We can find other invariants by using some tricks. Notice that for m odd, we 
can show that the 5-matrix obeys 


=(-!> 


/ — 1 or',m+l— s' 


= (-D 


5-1 


s rs 

°r,m+l-s 


(4.4.12) 


This, in turn, implies that we can form modular invariants out of the 
combination 


Xrs H“ Xr,m+1— s* (4.4.13) 

Specifically, we can show that the following is also an invariant [5]: 

z = + Xrm + 1 - s l 2 - (4.4.14) 

r s odd 

Thus, we have constructed two infinite series of modular invariant partition 
functions. Similarly, it can be shown that there is only a finite number of other 
possibilities, corresponding to m = 11,12, 17, 18, 29, 30. Later, we will see 
how they give rise to two infinite series and three finite ones, which exhaust 
all possible modular invariant partition functions for the minimal series. 

However, we would like to find a systematic way in which to construct these 
invariants rather than appealing to mathematical tricks. To gain some insight 
into this difficult question, we now turn to the characters over the Kac-Moody 
algebras and will compute the modular invariant functions for both the minimal 
model and SU(2) k . 


4.5 Affine Characters 

It can be shown that the minimal series found earlier exhausts all possible 
unitary representations with a finite number of primary fields. However, we 
can enlarge the system to include Kac-Moody algebras. Then, there exist 
representations of affine Lie groups with finite numbers of primary fields where 
the fields are now primary with respect to both the conformal group and the 
affine Lie group. (These affine systems, however, can have an infinite number 
of primaries with respect to just the conformal group.) 

To understand the characters of these Kac-Moody algebras, let us first cal¬ 
culate the character of the minimal models in another way, using the coset 
construction of the previous chapter. We will write the character of the minimal 
model in terms of the character of the simplest affine Lie group, SU( 2)^. 



4.5 Affine Characters 109 


In general, modules over the affine Lie algebras will be more complicated 
than conformal modules because they are generated by isotopic ladder oper¬ 
ators J°_ m as well as L_„ operating on the highest weight state. To typify a 
state, we must calculate its eigenvalue under both the level operator L 0 and the 
Cartan subalgebra Hq. 

Let us consider the level k representation built on the spin- j vacuum state 
| j) for S U ( 2)k . The character will now depend on two variables, r (associated 
with the Virasoro operators appearing in the module) and 6 (associated with 
the Kac-Moody operators). We define 

X^(e, r) = q~ c/14 lr (jhk {q La e 1 ^). (4.5.1) 

For a more general affine Lie algebra, the character now depends on the 
parameter r as well as 0\ where i ranges over the elements of the generators 
Hq of the mutually commuting Cartan subalgebra. We define 

r) = q~ CG/24 Tr W A<l Lo e l6 ‘ H;> )- (4-5.2) 

Fortunately, almost all formulas in the theory of Lie algebras generalizes to 
the affine case, so we will need the Weyl character formula for an arbitrary 
Lie algebra and its generalization to the affine case: the Weyl-Kac character 
formula. 

To understand the Weyl character formula for Lie algebras and the Weyl-Kac 
character formula for affine Lie algebras, we must first make a few definitions. 
In the Cartan-Weyl basis, we define or z to be the root vectors. A vector p can be 
written in terms of these roots as p — ^ c z a z . If the first nonzero coefficient 
Ci is positive, then we say that p is a positive root. (This definition is somewhat 
arbitrary, of course, since we can mix up the root vectors. But, once a fixed 
basis for the roots has been chosen, this convention is a useful one.) 

For the group 5(7(2), for example, we notice that the representations are 
indexed by the integral or half-integral / and contain elements that run from 
—l to 47, that is, they are symmetric if we rotate them 180°, exchanging L 3 for 
—L 3 . This symmetry under reflections is called a Weyl reflection. For higher 
Lie algebras, the representations (when plotted on a graph whose coordinates 
are the independent eigenvalues of the Cartan subalgebra) have a larger discrete 
symmetry, which can be generated by a group of reflections called W , the Weyl 
group. For example, if we take a root A, we can rotate it as follows: 

w a (X) = A — a (A, a), (4.5.3) 

where the Weyl rotation operator w a within W is associated with the root a. 

Let us also introduce the convenient notation that e a , where a is a root, 
represents an operator that acts on an arbitrary root /3 as follows: 


e a (fl) = e {a ^. 


(4.5.4) 



110 4. Modular Invariance and the A-D-E Classification 


Then, the classical Weyl formula states that the character of a representation 
L( A) (associated with a highest weight vector A) is given by [16]: 


ch L(A) = 


Zu,ewt(v)e w(A+P) - p 


(4.5.5) 


where we sum over the elements of the Weyl group, and e(w) is +1 (-1), 
depending on whether the member of the Weyl group w can be expressed in 
terms of an even (odd) number of reflections. Here, p is half the sum over all 
positive roots 


P = 



(4.5.6) 


Example: SU(3) 

This formula can be used to calculate the dimension appearing in any repre¬ 
sentation R of a Lie group. Let Eq. (4.5.5) operate on a vector, called p, and 
then let p go to zero. The limit as this arbitrary vector goes to zero can be 
easily computed, and we find the celebrated result of Weyl: 

( 4 . 5 . 7 ) 

}>o P) 

where the sum over the Weyl group has now been replaced by the product over 
the positive roots. This is a very powerful result and can be used to calculate 
the dimension of virtually all the representations found in classical Lie group 
theory. 

For example, for the group St/(3), we can calculate the dimension of a 
representation with Dynkin coefficients (m i, m 2 ). Inserting this into the above 
expression, we find 

dto * = ( "" + ^ +2 ) . (4.5.8) 

Inserting various values of m\ and m 2 into the equation, we easily compute 
the dimension of the well-known representations of SU( 3). 

Example: SU{2)k 

Now that we have treated the classical case, we wish to generalize this discus¬ 
sion to the affine case. For a Kac-Moody algebra, the Weyl-Kac formula is, 
remarkably enough, essentially the same as the Weyl formula, except that the 
definitions of a root vector and the Weyl reflection have to be generalized. 

We must generalize our previous discussion because the Kac-Moody alge¬ 
bra differs from the usual Lie algebra in two essential ways. Besides the usual 
root vectors, we also describe states by the number operator (eigenvalue of 
Lo ) and the c-number term appearing in the algebra. Thus, a root vector in the 
Kac-Moody case actually has three entries 

X = (X, k, ft), 


(4.5.9) 



4.5 Affine Characters 111 


where k is the classical root vector, k is the eigenvalue of the number operator, 
and n is a c number. 

We take the scalar product and the bracket product of two vectors a — 
(a, k , n) and P = 08, k\ n') in the following way: 

(a, P) = (a, P) + kri + nk\ (a, P) = (4.5.10) 

(a, a) 

We define the Weyl reflection in the same way as before 

u; a (A) = k — a (A, a), (4.5.11) 

except for the important fact that, because the root vector now has three entries, 
the effect of a Weyl reflection consists of a classical Weyl reflection and a 
translation. 

This translation is easy to see. If we let a = (a, 0, 1) and k = (k, k, n), then, 
inserting both expressions into the definition of a Weyl reflection, we have 

w a (k) = lwa(k + 2ka/a 2 ), k, n + —[k 2 — (k + 2fccx/cx 2 ) 2 ] j . (4.5.12) 


This Weyl reflection is easily split into two parts, the classical Weyl reflection 
(which we denote by W) and a translation by the following vector: 




k + kp , k, n + —[ k 2 — (k + kp ) 2 ] 
2k 


(4.5.13) 


where P — 2a/a 2 . (For example, for the simple case of S U (2)*, this separation 
is trivial: the classical Weyl reflection just flips the root a into —a, and the 
translation produces a shift by jot , where j is an integer.) 

Let us perform the sum over translation T first, thereby obtaining a 0 
function, and sum over W later. When we perform this separation, we find 

€(w)e w(p) ~ p = e~ p e(u>) Y 

weW weW 

= e~ p+p2s/2 s Y e(«')0»o»). (4-5.14) 

weW 

where M is the lattice generated by translations, 5 = (0,0, 1), p = 
p — g(0, 1,0), and the © function comes directly from summation over the 
translations 

0^ = e -\M 2 S/2k _ e t(0,l,0) ^2 g-*lKl 2 (0.0,l)+^/2_ (4.5.15) 

teM yeM+k-'l 

In this form, the character can be written totally in terms of © functions as 
chL(A) = 

€(w)©iy(p) Cp 

IA+PI 2 IPI 2 


(4.5.16) 



112 4. Modular Invariance and the A-D-E Classification 


For the case SU (2)*, we find a vast simplification in all our formulas. In this 
case, there is only one root a , where a 2 = 2. The Weyl reflections simply flips 
the root, and the sum over translations equals the sum over ja for integer j. 
In particular, we find that the 0 function can be written as 

®n,k(u, r, z) = e ~ 2niu ^ exp(27nrmy 2 + 2 nijz) (4.5.17) 

jeZ+n/2m 

(where we set u — z — 0). Then everything can be expressed in terms of 


C A + P -* C n , k = 0*,* - (4.5.18) 

Simplifying the above results for our case, we find the final expression for the 
character [17]: 


Xk(?) — C 2j + Uk +2/C U 2 


oo 

= T]~ 3 (t) [ 2n (k + 2) + (2 j + 1)] X e i”rlMk+2)+2j+lf/2(k+2)' 

Yl ——OO 


(4.5.19) 

where A. = 2j + 1. This is our final result for the character of .S'L'(2)/ : . (To 
prove the last step, we used the identity ry’{x) = J2 m (^ m + l)^ (4m+1,2/8 .) 
Under S and 7 . this formula transforms as 


T : 

5 : 


Xi(r + 1) = exp 




Xx(t), 



(4.5.20) 


where N — 2(k + 2). 

Using the modular transformations T and S for SU(2) k , we can now read 
the explicit values for the matrices that generate modular transformations 


pW _ 

“V - 


2 Y /2 . n-(2; + l)(2/+l) 

--- sin- 

k -j- 2 J k -K 2 


rp(k) 

jj' 


— exp 


(2 j + l) 2 


4 (* + 



(4.5.21) 


with j , / = 0,..., k/2. It is straightforward to check that S 2 = (ST) 3 = 1, 
as they should. 

Now that we have successfully calculated the characters for SU(2) k , we are 
in a position to exploit this result and calculate the characters of the minimal 
models using the coset construction, giving us an independent check on the 
correctness of our formalism. In the last chapter, we showed that the min¬ 
imal models are equivalent to the GKO construction for the coset, that is, 
G/H = SU(2)k ® SU(2)\/SU(2) k+ i. We now proceed by noticing that the 
energy-momentum tensor T(z)g splits into two commuting sectors T(z)g/h 
and T(z)h * This allows us to write the Fock space of the theory in terms of the 



4.6 A-D-E Classification 


113 


direct product of the two sectors. Concretely, it means that we can decompose 


the character of affine G in terms of the characters associated with T G/H . 
Symbolically, we can write 

x* c 00 = xgjhx*"- 

(4.5.22) 

Under a modular transformation, we have 


X*°(t ') = S kG x kc (T). 

(4.5.23) 

This means that 


X* g (t') = XGiH(j')S kH x kH (r). 

(4.5.24) 

We can now solve for the transformation of the character of the coset, so 
that we have the desired result 

XG/H(O = 5‘ 0 XG/«(TK5*"r I . 

(4.5.25) 


This, in turn, gives us an independent way in which to confirm our previous 
formulas concerning the minimal model. Using the above formula for the 
characters of the cosets, we can calculate the character of the minimal models 
in terms of the characters of the affine SU (2)*. The calculation is not difficult. 
We use Eqs. (4.5.25) and (3.3.10) to calculate the character of the minimal 
models and find exact agreement with Eq. (4.4.6), which was derived in an 
entirely different fashion, by subtracting the characters of null state Verma 
modules. 


4.6 A-D-E Classification 

Now that we have successfully computed the character y x (r) for SU(2) k , our 
next step is to construct a modular invariant combination of such representa¬ 
tions. Because the x^(r) are not modular invariant, we will assume that we 
can create a genuine modular invariant by analyzing the bilinear expression 
[9-11]: 

Z = £x*(rm,A'Xv(T). (4.6.1) 

X,X' 

Fortunately, with some work, it is possible to write the complete list of solutions 
to the N- A jj matrix, giving us all modular invariant partition functions for affine 
SU{ 2)*. For example, one trivial solution is given by the diagonal matrix 

N kv ~ (4.6.2) 

Then, the sum over X runs from 1 to k + 1, and the invariant becomes 

*+i 

£lXx| 2 - 

A=1 


(4.6.3) 



114 4. Modular Invariance and the A-D-E Classification 


The complete solution to the problem of constructing modular invariants for 
SU ( 2)k , however, is more involved. 

Under a general modular transformation, the characters transform as 

N -1 

X*(r ') = X>a,*<(A)x*'(t), (4.6.4) 

X'=0 

where the U (A) matrix generalizes the S and T matrices found earlier, and A 
is an element of the modular group. It satisfies 

U{A)U{A f ) = e i<t>{A ' A,) U{AA\ U(A)U\A) = 1. (4.6.5) 

To prove the modular invariance of Z, it can be shown that this means the N 
matrix of Eq. (4.6.1) must satisfy 

NU(A) = U(A)N. (4.6.6) 

Finding the general solution of this equation is rather difficult and not very 
transparent, but the result is quite elegant. What is remarkable is that the final 
classification is so simple, corresponding in a one-to-one fashion with the 
A-D-E classification of Lie algebras. Each solution to the above equation 
corresponds to one of the simply laced Lie groups [18-22]. 

Let us write these modular invariant characters. We use the symbol 
Z(A, D,E) to represent the modular invariant that can be placed in 
correspondence with one of the simply laced Lie groups. Then, we find 
*+i 

Z(A t+1 ) = ]T Ixxl 2 , *>1, 

A=1 
2p—1 

Z(Z?2p+2) = L! IXx + Xap+2-x\ 2 + 2|x2p+i| 2 . k = 4p, p > 1, 

Aodd=l 
4p—1 

Z{D 2p+ \) = IXaI 2 + |X2 P | 2 

Xodd=l 

2p—2 

+ ^2 (XxX4 P -a + cx “)’ k = 4p-2, p> 2, 

A.even=2 

Z(£ 6 ) = IXi + Xi\ 2 + 1X4 + Xsl 2 + IXs + Xu I 2 , k + 2 = 12, 

= |xi + Xi71 2 + 1X5 + XbI 2 

+ Ix? + XiiI 2 | + IX9| 2 , k + 2 = 18, 

Z(£g) = Ixi + Xll + Xl9 + X291 2 + 1X7 + Xl3 

+ Xn + X 23 1 2 » k + 2 = 30. (4.6.7) 

The deeper reason why this elegant one-to-one correspondence exists be¬ 
tween the modular invariants of SU(2) k and the simply laced Lie groups is still 
rather obscure and not well understood. (General arguments can be made to 
show that, given a simply laced Lie group, one can construct modular invariant 



4.6 A-D-E Classification 


115 


combinations for SU(2) k . However, this does not explain why all the modular 
invariant combinations should be generated in this way.) 

For completeness, we now present the A-D-E classification of the modular 
invariants for the minimal series mentioned earlier, where we displayed two 
infinite series and mentioned the existence of three exceptional cases. 

This series for the minimal theory can be placed in correspondence with 
pairs of simply laced algebras. For example, the three exceptional cases can 
be placed in correspondence with (A, E). 

The complete set of modular invariants for the minimal series are then given 
by [18-22]: 


p '~i p -1 


z(Apf~ i, A p - 1) — y ' y ] i Xr 


r= 1 5=1 


J P~ 1 

A p - 1) = — y ' 


4p+l 


y ^ I Xrs I "F 2|/2p+l,5 


odd=l 
Lr^2p+I 


2p—1 

+ (XrsXr.v-s + CC -) 

r odd= 1 


p' = Ap + 2, p > 1, 


P~ 1 


Z(D 2p +u Ap- 1 ) — ^ ' 


5=1 


4p— 1 


X IXr5| 2 + IX2p,5p 


r odd=l 


2p—2 


+ y (X«Xp- r>i + C.C.) 

reven=l 

2 p-> 

Z(^6. ^p-l) = -Z y (iXl, + X7 S | 2 + 1X4, + X8,| 2 


p’ — 4p, p > 2, 


+ IXs, + Xn,l 2 )> p' — 12, 


p 1 


Z(£ 7 , Ap_i) = - y {iXl, + Xl7,| 2 + 1X5, + Xl3,| 2 + 1X7, + Xll, 


5=1 


+ 1X95 I 2 + [(X35 + X\5s)X9s + C.C.]} , p' — 18, 

2 P-i 

Z(^8* ^p-l) = Z X (1^15 + Xl 15 + Xl95 + X295I 2 

1 5=1 

+ 1X75 + X135 + X17 5 + X2351 2 ) 5 P — 30. (4.6.8) 


(For these modular invariants, we find that the central charge equals c — 
1 — 6 (p — p')/pp'. To construct modular invariant combinations, we can show 
that the matrix N rs y S ' factorizes in terms of sums over N rr f and N ss > with levels 
k = p — 2 and k' — p f — 2. Since p and p' are coprime, this means that they 
cannot both be even, so that one of the modular invariant combinations must 



116 4. Modular Invariance and the A-D-E Classification 


have N = 1, that is, it must be of the A-type. That is why the A series always 
appears in each pair of modular invariant combinations.) 


4.7 Higher Invariants and Simple Currents 

In the last section, although we achieved a complete classification of the mod¬ 
ular invariant combinations for SU( 2)*, at this point it may seem prohibitive 
to generalize this calculation for the higher Kac-Moody algebras SU(N) k . 

Actually, there is a trick one can use which considerably cuts down the 
work necessary to generate the higher modular invariant combinations. In fact, 
we will only use part of the information contained within the fusion rules to 
calculate these modular invariant combinations. 

Let us begin by defining a simple current 7 [23] as a primary field which 
has the following fusion rule with all other primary fields <t> : 

7 x0 = 0'. (4.7.1) 

This differs from the usual fusion rules in an important way. The crucial 
observation is that just one primary field, rather than a sum, appears on the 
right-hand side. We define the conjugate field J c such that it has the fusion 
rule 7 x J c — 1. 

Now let us multiply 7 repeatedly with itself n times 

j n = j x j x ... x j, (4.7.2) 

Since the fusion rules are associative, J n is also a simple current, that is, 
J n x O = 0„, where 0„ is a single primary field. Soon or later, we will find 
that this process terminates because J N = 1 for some N. Then the order N of 
7 is the smallest integer N for which J N = 1. The orbit created by repeated 
multiplication by J thus has N elements, such that 7 0 = J N = 1, J\ = /, 
and 7 n c = J N - n . Thus, by repeatedly using the fusion rules as a multiplication 
operator, we have generated an orbit of simple currents whose elements {/„} 
form the group Z N . 

(The orbit of J may not exhaust all possible simple currents. In general, other 
primary fields may generate other orbits of order N t , which in turn generate 
the group Z Nl ® Z Nl ® • • • 0 Z Nk .) 

Now let us analyze the monodromy properties of simple currents a bit more 
carefully. The fusion rules give us 

J(z)J(w) - (z - wy a J 2 (w). (4.7.3) 

Let us now repeatedly multiply this fusion rule with /. Because J N = 1, we 
can show that the exponent a, must equal r/N for some integer r. We call this 
integer r the monodromy parameter, which will label the modular invariants 
we construct using this method. 

Let the conformal weight of J under L 0 equal h j. Since we know the con¬ 
formal weight of 7, then we can calculate the conformal weight h n of the 



4.7 Higher Invariants and Simple Currents 117 


element J n : 


rn(N - n ) 

h, = 2N mod 1 


(4.7.4) 


for r = 0, 1,..., N — 1 for odd N and r = 0, 1,..., 2N — 1 for N even. (We 
have imposed charge conjugation, so that h n = h N _ n .) 

Now that we have calculated the orbit of operators created by repeated 
multiplication by J, let us now calculate the properties of the orbit created 
by repeatedly fusing another primary field <t> with J. Its fusion rule gives us 
7(z)<I>(w) ~ (z — w)~ r/N <t>i(w) for some integer monodromy parameter t. 

Using the same arguments as above, we can determine the conformal weight 
of < f>„, denoted by h(d>„) in terms of the conformal weight //('$>) of 4>: 


h(<t>„) = &(<*>) + 


rn(N - N ) 
IN 


— mod 1. 
N 


(4.7.5) 


It is useful to introduce the concept of a conserved charge. The charge of <I> 
with respect to J is defined to be <2(<I>) — t/N mod 1. Then we have 


Q ^' )=! N + r i- (4 - 7 ' 6) 

We also define Q n ( 4>) = nQ mod 1 to be the charge of <t> with respect to 
the primary field 

This charge is important for several reasons. First, if we take the fusion rule 
O/ x d> 7 = Ylk Cijk®k H-,we find that 


G(<&,-)+G(<&;)= G(4>*) (4.7.7) 

for all fields which appear in the fusion rule. In other words, the charge in 
conserved under the fusion rule. 

Second, the charge appears when we carry a field around a twist field. 
If we define a twist field as T(z , z) = J(z)J c (z ), then we can calculate the 
effect that this twist operator has when we carry a primary field (z, z) (with 
left (right) moving conformal weight labeled by i(j)) around the twist field. In 
general, we pick up the phase exp(2;r Qi) where the total charge Q is defined 
asQ= Q&d+Q&j). 

So far, our discussion of simple currents has been rather formal. We now 
come to the heart of this construction. If we are given any modular invariant 
partition function (e.g., the trivial, diagonal one), we can form yet another 
modular invariant function in a simple way. First, remove all states in the 
diagonal sum which are not invariant under Q (i.e., those which do not have 
integer charge) and then add all twisted sectors (all those obtained from the 
untwisted sector by acting with twist T ). 

Notice that this is a generalization of the usual trick used in constructing 
modular invariant partition functions on orbifold spaces, that is, we begin with 
a known modular invariant function on a given space, and then modify the sum 
by including contributions with different boundary conditions. In this way, new 



118 4. Modular Invariance and the A-D-E Classification 


modular invariant combinations defined on orbifolds can be defined in terms 
of known modular invariants. 

There are several advantages to this approach. First, we do not have to know 
the entire set of fusion rules, just the set of fusion rules for the orbits. Second, 
the group structure of the orbits is trivial, given by Z N . Third, we can always 
generate new modular invariant combinations starting from a known invariant, 
such as the diagonal one. The resulting modular invariants will, in general, be 
nondiagonal. 

When we employ this trick, we will find that if a particular conformal field 
theory has center Z N , then different modular invariant combinations are gen¬ 
erated for every divisor of A if A is odd or if N and r are both even. In the 
case where N is even and r is odd, then this method generates different mod¬ 
ular invariant combinations for every divisor of N/ 2. (The standard diagonal 
invariant corresponds to the divisor being 1.) 

Let us examine how we turn a known modular invariant into another one by 
this procedure. For example, if N = 9 and r — 3, we can convert the diagonal 
modular invariant into a nondiagonal one. 

For Zi, we have 


£ £lx,l 2 - (4.7.8) 

all orbits i —0 

For Z3, we have 

Y. IXo + X 3 + X61 2 + Ixi + X 4 + Xi\ 2 + 1 X 2 + X 5 4- Xs | 2 ‘ (4.7.9) 

03=0 orbits 

For Z9, we have 

£ IXo + X3 + X6p + [(Xi + X4 + X 7 XX 2 + Xs + Xs)* + c.c.]. (4.7.10) 

03=0 orbits 

Let us now be more precise. Let J n be of order N in Z N . We begin by postulat¬ 
ing that the following combination is modular invariant: M tj / ( 3 >, )x*(®j)- 

Then we make the appropriate twists, so the modular invariant turns into the 
combination 


N 


N -1 

E 

/=0 


e 2ixl[Qn(*i)+Qn(<l>j)] 


N -1 


£ Mij 

k=0 


(4.7.11) 


Notice that we have made two important changes in the original modular 
invariant combination. First, the sum over l projects onto charge singlet states. 
Second, the sum over k yields the twisted sector. 

From this, we can read off the explicit expression for the new modular matrix 
M corresponding to the new invariant. We find [23]: 


1 

Mj<* a ,JPb — — 



N 

£ ex p 


p =1 


( 2a+1 \ 

2nip (g(fl)+ 2N — r 1 


(4.7.12) 



4.8 Diagonalizing the Fusion Rules 119 


where the label J a a belongs to the orbit of the field <& a when we act on it by the 
current a times. The sum over p has the effect that the expression equals one if 
the argument within the parentheses equals Omod N and equals zero otherwise. 
Notice that the argument in the parentheses is equal to Q{J a a) + Q(J^b). 

The previous equation is the desired expression. It yields new modular in¬ 
variant combinations based on the method of simple currents. By applying 
the S and T matrices corresponding to modular transformations, we find that 
the previous expression is indeed modular invariant. For example, by explicit 
calculation one can show that SMS* = M, where S AB generates modular 
transformations. 

These results, in turn, can be generalized to include Kac-Moody algebras. 
For example, let us analyze the affine SU(N) k . Primary fields, as in the classical 
case, can be characterized by the Young tableaux, which is a sequence of boxes 
representing how we symmetrize or antisymmetrize the various indices. Let 
the symbol [m \, ra 2 ,..., m N _ x ] with m x < k represent the Young tableaux of 
a primary field where is the length of the ith row in the tableaux. 

Consider the primary field Y x = [k, 0,..., 0]. One can show that this pri¬ 
mary field is a simple current. If we multiply Y\ with another primary field, 
then we simply increase the length of the first k columns by one, yielding 
another Young tableaux. Y\ is a simple current, and so are the elements of the 
orbit given by Y n = [ k , k,k, ... 0], which has n rows of length k. Further¬ 
more, with a little work, one can show that the fields given by Y n are the only 
simple currents. The group generated by the orbit is isomorphic to Z N , which 
corresponds to the center of SU(N) k , which is also Z N . Thus, the center gen¬ 
erated by the simple currents corresponds to the center of the corresponding 
Kac-Moody algebra. 

Given the explicit representation of the simple currents of SU(N), we can 
now calculate the conformal weights of the operators 


K 


kn(N — n) 
IN 


(4.7.13) 


which agrees with our previous expression for conformal weights if we identify 
the monodromy parameter r with the level k. 

Continuing in this way, we can derive all known results for the modular 
invariants of the various groups, such as the complete classification for SU (2)*, 
as well as partial results for SU(N) k . This method also generates new modular 
invariants. 


4.8 Diagonalizing the Fusion Rules 

At this point, it appears as if the discussion on characters and modular properties 
seems divorced from the fusion rules discussed earlier. However, because of 
the highly restrictive nature of conformal symmetry and because a great deal of 
information is encoded within the S matrix, there is a rather remarkable formula 



120 4. Modular Invariance and the A-D-E Classification 


found by Verlinde [24] and proven by Moore and Seiberg [25] concerning 
the relationship between these two concepts. In fact, given the modular S 
matrix, which governs the modular properties of the characters, we can actually 
calculate the fusion rules! Specifically, it turns out that the fusion rules, which 
are determined by the matrix A*, can be written in terms of the S matrix in 
the following fashion. 

First, we will show that S diagonalizes the fusion rules, that is, 

^ = £ S "^" )5 «’ (4.8.1) 

n 

where the A- n), s are the eigenvalues of the N matrix. Using this, we can now 
present the full statement 


an an ak 

n ?j = Y. 11 " 


an 


(4.8.2) 


This allows us calculate the fusion rules by inserting the modular S matrix 
into this equation, giving us an independent check on previous results and also 
giving us new fusion relations. For example, for SU ( 2 ) k , we can use the fact 
that the S matrix is [see Eq. (4.5.21)]: 

/ o \ V 2 

= (4 - 83) 

Inserting this expression into our formula for the N matrix via Eq. (4.8.2), we 
find that the fusion rules are given by 


min(y'+/,fc—7—/) 

SU(2) k : [<Pj] x [<t> r ] = J2 ( 4 -8- 4 ) 

j"=\i-i'\ 

[In passing, we remark that the fusion rules for the minimal model, found in 
Eq. (2.6.12), can be seen to be related to the SU(2) k fusion rules if we make 
the substitution p t — 2 j, + 1 and q, = 2j[ + 1.] 

To prove this remarkable result, Eq. (4.8.2), which shows that the S matrix 
diagonalizes the fusion rules, we first remind ourselves that the character is 
obtained by tracing over the Verma module associated with the /th primary 
field [</> ;]: 

X;(r) = Tr m {q L ^) (4.8.5) 

for € — —c/24 and q = e 2nix . Notice that the 5 operation changes the /th 
character into a sum over the j th characters 

S: Xi ( _ r) = 


(4.8.6) 



4.8 Diagonalizing the Fusion Rules 121 


Our goal is to change the summation over the ith module to the j th module. 
To do this, we are going to manipulate this expression by changing the basis 
of the summation in the trace. 

Within the trace, let us insert the number 1, which does nothing. The trick, 
however, is to rewrite the number 1 as the operator product expansion of a 
primary field 0/ and its conjugate field. In the trace operation, let the a cycle 
represent the line of equal r, and the b cycle the line of equal a. 

Now, move the 0/ field along the r direction, until it hits the summation over 
the 0 7 primary field. When 0, and 0 y come close to each other, we must use 
the fusion rules. Notice that the effect of using the fusion rules is to change the 
bases of the summation in the trace. 

Let us define this operation as 0;(£), which is an operator, not a field. Thus, 
we have 


<Pi(b)Xj - Nlxk, <t>i{a)Xj = A j) Xj- (4.8.7) 

In the second equation, we have inserted 0/ into the trace and then moved it 
along an a cycle, that is, a line of equal r. This does not change the basis of 
the states at all, but simply inserts a constant matrix into the trace. 

There is a big difference between these two expressions. The transformation 
along the b cycle turns the j th character into a sum over the kth characters, 
while a transformation along the a cycle simply maps the j th character back 
into itself. 

The important step is now to perform the S operation on both of the above 
equations. The a cycle and b cycle are interchanged, while the characters 
change via the S matrix. Thus, the S operation changes these two relations 
into the following: 

Ma)S)xk = NfjStxi, <Pi(b)SjXk = ' (4.8.8) 

By simple manipulations of these two equations, we now have 

N t = H S ?" ( i n}Sk n- (4.8.9) 

n 

We also know that Nf Q = 8* (because the fusion of the /th Verma module 
with the identity again yields the ith Verma module). Putting j = 0 into the 
previous expression, we get 

x (n) _ ( 4 . 8 . 10 ) 

Inserting this expression into the previous one, we now have the desired ex¬ 
pression. [We also note that Eq. (4.8.2) can be proven more rigorously using 
the pentagon and hexagon formula [25] of rational conformal field theory.] It 
will be helpful to illustrate this with some examples. 

Let us take the case when there are no primary fields at all, except for the 
identity. This may seem trivial, but it is actually quite illustrative of several 
important principles. In this case, the only primary field is the identity, and we 



122 4. Modular Invariance and the A-D-E Classification 


have a simple expression for the action of T and S : 

T: X -> <? 2iri( - c/24) X , 5: x X, (4-8.11) 

so 5 = 1. 

We must also satisfy the relation (.ST) 3 = 1; therefore, we have the constraint 
[ e 2*i(-c/24)]3 _ l Qr 

c = 0mod8. (4.8.12) 

For example, for k = 1, this can be satisfied for the affine groups £ 8 (where 
c — 8) and for S0(16) (where c — 16). 

Let us now examine the case when there is only one nontrivial primary field. 
In this case, the only possible fusion rule is 

[<f>]x[<t>] = l + n[<j>l (4.8.13) 


Then, a representation of the S and T matrices are as follows: 


^ _ /cos 9 sin 0 

~ ysin# — cos# 

/ e 2ni(-c/24) o \ 

1 - 1 0 e 2iti(h-cf2A) I • 

Now let us impose two constraints. The first is that the S matrix diagonalizes 
the fusion rules. This easily leads us to tan 0 = k. Then, the second constraint 
(ST) 3 = 1 reduces to 

12 h — c — 2 (mod 8), coslnh = ~n\. (4.8.15) 

These values are only defined modulo factors of 8. (This is because we can 
always tensor any conformal field theory with an independent affine E% theory, 
which has c — 8, which does not change the value of h or the fusion rules.) 

For n — 0, one example is the level 1 SU( 2) model. For n — 1, some 
examples include the level 1 G 2 and F 2 WZW models. (For higher n , we find 
that there are no consistent solutions to the various modular constraints.) 


4.9 RCFT: Finite Number of Primary Fields 

In this section, we will briefly review the arguments used to show that if the 
number of primary fields is finite, then the values of h and c must be rational 
numbers. We call these theories rational conformal field theories (RCFT). Let 
us begin with a sphere with four punctures, with primary fields & located at 
each of the punctures and then analyze the Dehn twists that one can make on 
them [26]. 

Let r i equal a Dehn twist where we twist around a circle that wraps around 
the 1 'th external puncture. Let x i} equal a Dehn twist where we twist around a 



4.9 RCFT: Finite Number of Primary Fields 123 

circle that wraps around both the i th and j th puncture. By explicitly performing 
the Dehn twists on a sphere, we can show that the following relation holds: 

Tir 2 T 3 T 4 = ri 2 r 13 r 23 . (4.9.1) 

Our strategy is now simple: we will perform the Dehn twists on a tensor 
defined on the product space of four primary fields. By equating the action 
of the left-hand side with the action of the right-hand side, we will have an 
enormously powerful restriction on both the values of h and c. 

The action of each r, is trivial. The operator e 27Ti ( L o~ L o) generates the twist. 
We fix the eigenvalue of L 0 to be equal the eigenvalue of Z 0 modulo integers. 
Then, we define the phase 

a,- =e 2nihi . (4.9.2) 

The action of the Dehn twist r, on the product space of the primary fields is 
then just a multiplication by a,, since the eigenvalue of L 0 (appearing within 
the twist operator) is h ( . 

Let the dimensionality of the product space for the product of four primary 
fields be labeled N ijk i. Then, the action of the Dehn twists is 

Tir 2 T 3 r4 aia^a^a^I, (4.9.3) 

where I is the identity operator, which is an Nijki x N ljk i matrix. If we take the 
determinant of this matrix, the answer is 

(ai a 2 a 2 a ^) Nljkl . (4.9.4) 

The action of the other Dehn twists on the right-hand side of Eq. (4.9.1), 
however, is more complicated because the action of r^ is not diagonal. In 
general, the action of this Dehn twist mixes up the representations, so that 
the resulting matrix cannot be simultaneously diagonalized for all such Dehn 
twists. 

The answer is to diagonalize each Dehn twist, one at a time. Take x\ 2 . Now, 
slice the sphere in half, so that the pairs of punctures 12 and 34 appear on 
opposite sides of the slice. Place a primary field (/> r at the slice. Notice that we 
now have two smaller spheres, each with three punctures, with primary fields 
1,2, r on one sphere and 3, 4, r on the other sphere. The Dehn twist r i2 is 
then represented by exp(27r/L 0 ), where L 0 acts on the rth primary field. The 
contribution of this Dehn twist (in the basis where the rth space is diagonal) 
is given by 

(4.9.5) 

where are the usual fusion coefficients. 

If we take any other Dehn twist and slice along any other channel, we can 
still use the rth Hilbert space if we use a matrix U that changes basis. Thus, 

t 23 UT 23 U-'. 


(4.9.6) 



124 4. Modular Invariance and the A-D-E Classification 


The action of all three Dehn twists, over all slices, will contain many U 
matrices. However, if we take the determinant of the resulting product, all of 
them will conveniently drop out. Thus, taking the determinant of the right-hand 
side and setting it to the determinant of the left-hand side and equating, we 
arrive at the final formula 

(ptiajOt k ai) NiiU = Yl a r NiikU , (4.9.7) 


where 


Nijki,r — NijrNur + Njk r Nu r + Nik r Nji r . (4.9.8) 

(Each term in the expansion on the right-hand side corresponds to a slice that 
bisects the sphere in half, such that we place the rth space of primary fields at 
the slice.) 

Now, we make the crucial assumption: the number of primary fields is finite. 
Assume that there are N primary fields. Then, the previous equation is highly 
overconstrained. There are only N unknowns (corresponding to the a r ), but 
there are 


(N + 1 )(N + 2 )(N + 3 )(N + 4)/4! (4.9.9) 

relations among them. Our goal is to show that, unless h and c are rational, 
there is no solution for finite N. 

Let us first show that h must be rational. Let us set / = j = k = /. Then, 
there are N equations in N unknowns. 

The ith equation reduces to 

(a^ffmi-N.uu = 1 . ( 4 . 9 . 10 ) 

r±i 


If we define the matrix 

Mir = SriilNiiii - Nuuj) + (1 - SriX-Nuu^ (4.9.1 1) 

then our constraint equation reads 

Mh = Omodl, (4.9.12) 

where h is now a column matrix with entries given by hi. It is not difficult to 
show that M is invertible and has a nonzero determinant. Then, 

kh = Omodl, (4.9.13) 

where k = det M. This implies that the h t are multiples of the inverse of k , so 
that hi are rational, as desired. Next, let us show that c is also rational. 

We recall that S generates modular transformations that interchange the a 
and b cycles, while T twists the a cycle. They satisfy 

(ST) 3 = 1. 


(4.9.14) 



4.10 Summary 125 


As before, let us take the determinant. One complication is that the representa¬ 
tions may or may not be self-conjugate. If they are self-conjugate, then S 2 = 1. 
If they are not, then S 4 = 1. 

Thus, let us raise the previous equation to the second or fourth power and 
then take the determinant. We find, in these two distinct cases, 

det(T) 6 = 1, det(T) 12 = 1. (4.9.15) 


Now, treat this equation as an operator equation, operating on the product 
space of primary fields of the sphere with punctures. Since the eigenvalues of 
the T matrix are given by exp[27r/(/i* — c/24)], we know that 


det T = Y\ a r exp ^ 24 ^ ) ’ ( 4 . 9 . 16 ) 


Raising this to the sixth or twelveth power, we then find 


det(jy 


= exp — 


(N + 1)tt/c 


flirt = 


r =0 
N 


det(T) 12 = exp [ - (N + l)7r/c] ]~J(a r ) 12 = 1- 


(4.9.17) 


r—0 


Solving for c, we find that it must be rational if h is rational, as desired [26]. 

Last, before ending this chapter, let us remark on the completeness of our 
analysis. For tree graphs, it can be shown that the B and F operators and the 
hexagon and pentagon identities they obey enable us to completely determine 
the rational conformal field theory. We have replaced the infinite set of Virasoro 
generators and the infinite set of elements within Verma modules to a finite set 
of equations given by the 6 j formalism. In some sense, they can be treated as 
the defining relations for a rational conformal field theory. 

At the higher loop level, we see that this is not enough. We must also define 
the S and T operators. However, it can be shown that if the one-loop graph 
for a rational conformal field theory is modular invariant, then all higher loop 
graphs must also be modular invariant using the combined set of operators 
[27]. This is a gratifying result, because it shows that the braiding and modular 
operators that we have so patiently constructed in the past few chapters are 
actually enough to define the entire modular invariant multiloop series. No 
new operators are necessary. We have, in some sense, finished the program of 
defining the perturbation series for the rational conformal field theories. 


4.10 Summary 

Because of the large number of conformal field theories that one can write, 
we wish to impose physical conditions on them. One of the most important 
is modular invariance, that is, we wish to subtract those global conformal 



126 4. Modular Invariance and the A-D-E Classification 


transformations that cannot be continuously deformed to the identity. Pertur- 
batively, when we sum over inequivalent Riemann surfaces, we must divide 
by the modular group, or else we will have infinite overcounting. 

A torus, for example, can be represented by a parallelogram whose opposite 
sides are identified. If this parallelogram is placed on the x axis, with one 
comer at the origin, then the complex parameter r uniquely specifies the toms. 
However, if r is mapped into 


T : r —* r + 1, 
S z —> — 1/r, 


(4.10.1) 


then the toms is mapped into itself. These two transformations, in turn, generate 
the modular group SL(2, Z), defined by 

QT + b 

r -* ~ x +d ^ (a, b,c,d e Z; ad - be = 1). (4.10.2) 

We wish to calculate the effect of the modular group on a representation 
of the conformal group. Specifically, we wish to calculate how the characters, 
which count how many states there are at each level, transform under the 
modular group. 

The character is defined as 


00 oo 

Trx L ° = ^x n dim V n = ^Vp(n). (4.10.3) 

n= 0 n =0 

In general, we will find that modular invariant characters are constructed 
out of the characters as follows: 


Z = Y J X* a N abXb (4.10.4) 

a,b 

for some constant matrix N ab . 

The characters for a free fermion field, c = can be calculated by taking 
into account the periodic (or antiperiodic) boundary conditions. There are 
four ways in which to specify the boundary conditions, depending on whether 
the boundary conditions on opposite sides of the parallelogram are periodic 
(Ramond) or antiperiodic (Neveu-Schwarz). 

We can, however, also rearrange them according to the conformal weights. 
We know that a c = \ system has three primary fields, with weights 0, 

By rearranging the above characters, we can write the characters for each 
conformal weight 

xo= \[x(a, a)+ X ( p, a)] = i (yf + yf ), 
xi/2 = itxM, a) - x(p, a)] = i (yi - yf) , 

Xi/16 - ^lx(A, P) ± X(P > P)] - 2= 



(4.10.5) 



4.10 Summary 127 


The object of this exercise is to write a modular invariant combination of 
characters. Since a modular transformation reverses the a and b cycles of a 
torus, it is easy to calculate how the characters transform. Thus, a modular 
invariant combination is given by: 

Zlsing = !(|0 3 /»?l + \*Ah\ + l*2/l?l ± \#l/r}\) 

= XoXo + X 1 / 2 X 1/2 + X 1 / 16 X 1 / 16 - (4.10.6) 

Similarly, we can calculate the characters for the minimal models. This can 
be done by calculating the character for the irreducible Verma module and then 
successively subtracting out the states given by null modules. The answer, after 
an infinite series of subtractions, is 


Xr, = q~ (c - X)/U ri{qT l (q hM+ '- s - q h ***-~‘). (4.10.7) 

k=—oo 

Likewise, we can calculate the properties of the minimal model’s characters 
by making modular transformations on them 

T : Xk(q) -* e 2 *‘ (h - cllA) Xh(q)- (4.10.8) 

A bit more difficult is the calculation of the character under an S transformation: 

Xrs(q) = 'Z S rsX Pq (ql (4.10.9) 

PA 


where 

8 

m(m + 1) 

It is now easy to calculate some modular invariant combinations of charac¬ 
ters. For example, taking a diagonal combination of characters (as in the free 
fermion case) yields a modular invariant. Less obvious are combinations such 
as 

Z = \j2Jl\Xrs + Xs, m+ l-s\ 2 . (4.10.11) 

r s odd 

To find more general modular invariant combinations, it is useful to use the 
characters of the Kac-Moody algebras, which can be defined as 

X ( -#\ r) = q ~ c ° /24 Tr w <: (g L,J e l9 ‘ H ' a ). (4.10.12) 

Notice that it has more parameters because we can trace over both L 0 and the 
Cartan subalgebra. 

The key to constructing these characters is the Weyl-Kac formula, which 
is a generalization of the classical Weyl formula, which allows us to calculate 
the dimension of any representation of any Lie group. 



(_ 1 )('+*Xp+ 9) sin S i n (4.10.10) 

m m + 1 



128 4. Modular Invariance and the A-D-E Classification 


The Weyl-Kac character formula is 


ch L(A) = 


E weW ^)e w(A+p) - 

€{w)e w{p) ~p 


(4.10.13) 


sweW 


where we sum over Weyl reflections of the root vectors. These reflections are 
defined as 


w a (k) = X-a(k, a), (4.10.14) 

where the Weyl rotation operator w a within W is associated with the root a. 

When applied to SU (2)*, the Weyl group reduces trivially to reflections 
around the L$ axis and translations along this axis. Thus, we find an explicit 
form for the characters 

00 

Xx(r) = rj~ 3 (r) [ 2 / I (fc+ 2)+(27 + l)]e ,irr[2 ' t(<:+2)+2 ^ l]2/2(i+2 \ (4.10.15) 


where k = 2j + 1. 

It is now a straightforward exercise to make modular transformations on this 
character and obtain a representation for the T and S matrices 


c(k) 

jj' 


*(m) 

''MSa 


n(2j + 1X2/ + 1) 


Jfc + 2 
+ l) 2 1 

4(k + 2) 8 


br> 


(4.10.16) 


with j, / = 0 ,..., k/2. 

The modular invariants that we can construct for the characters for the mini¬ 
mal model and S U (2)* have a mysterious regularity. In particular, we can place 
them in one-to-one correspondence with the simply laced A-D-E Lie groups. 
The origin of correspondence is not well understood. 

Because the 5 matrices contain a large amount of information concerning 
the conformal field theory, we suspect that many of the properties of the field 
theory can be rewritten in terms of the 5 matrix. Specifically, we find that the 
coefficients found in the fusion rules can be written in terms of the S matrix 


N 


rn an rk 

= £■ " 


on 

°0 


(4.10.17) 


We see that the S matrix diagonalizes the fusion rules. Because of this rela¬ 
tionship, we can independently check many of our results previously obtained 
for the fusion rules by inserting the S matrix into the above equation. For 
example, we can derive the fusion rules for SU(2 ) k : 


minU+j',k-j-j') 

[<Pj] X [</>,-,] = [<M- 

j"=u-j'\ 


(4.10.18) 



References 129 


References 


1. J. Shapiro, Phys. Rev. D5,1945 (1972). 

2. K. Kikkawa, B. Sakita, and M. A. Virasoro, Phys. Rev. 184, 1701 (1969). 

3. C. S. Hsue, B. Sakita, and M. A. Virasoro, Phys. Rev. D2, 2857 (1970). 

For reviews, see Refs. 4 to 6. 

4. R Ginsparg, in Fields, Strings, and Critical Phenomena , Elsevier, Amsterdam 
(1989). 

5. J. L. Cardy, in Fields, Strings, and Critical Phenomena , Elsevier, Amsterdam 
(1989). 

6 . J.-B. Zuber ^Fields, Strings, and Critical Phenomena, Elsevier, Amsterdam (1989). 

7. A. Rocha-Caridi, in Vertex Operators in Mathematics and Physics , J. Lepowsky, 
S. Mandelstam, and I. M. Singer, eds., Springer-Verlag, Berlin (1984). 

8 . A. Rocha-Caridi and N. R. Wallach, Math. Z. 185, 1 (1984). 

9. J. L. Cardy, J. Phys. A17, L385 (1984). 

10. J. L. Cardy, Nucl. Phys. B270 [FS16], 186 (1986); B275, 200 (1986). 

11. D. Friedan and S. Shenker, Nucl. Phys. B281, 509 (1987). 

12. D. Kastor, Nucl. Phys. B280, 304 (1987). 

13. Y. Matsuo and S. Yahikozawa, Phys. Lett. 178B, 211 (1986). 

14. A. Erdelyi, Higher Transcendental Functions , McGraw-Hill, New York (1953). 

15. F. Gliozzi, J. Scherk, and D. Olive, Nucl. Phys. B122, 443 (1983). 

16. V. G. Kac and D. Peterson, Adv. in Math. 53, 125 (1984). 

17. D. Gepner and E. Witten, Nucl. Phys. B278,493 (1986). 

18. A. Cappelli, C. Itzykson, and J.-B. Zuber, Nucl. Phys. B280,445 (1987). 

19. A. Cappelli, C. Itzykson, and J.-B. Zuber, Comm. Math. Phys. 113, 1 (1987). 

20. D. Gepner, Nucl Phys. B280 [FS18], 445 (1987). 

21. C. Itzykson, Nucl. Phys. Suppl. 5B, 150 (1988). 

22. A. Kato, Mod. Phys. Lett. B3, 3918. 

23. A.N. Schellekens and S. Yankielowicz, Phys. Lett. 227B, 387 (1989). 

24. E. Verlinde, Nucl. Phys. B300, 493 (1988); R. Dijkgraaf and E. Verlinde, Nucl. 
Phys. Suppl. 5B, 110(1988). 

25. G. Moore and N. Seiberg, Phys. Lett. 212B, 451 (1988). 

26. C. Vafa, Phys. Lett. 300B, 360 (1988); G. Anderson and G. Moore, Comm. Math. 
Phys. 117, 441 (1988). 

27. G. Moore and N. Seiberg, Lectures on RCFT , 1989 Trieste Summer School. 



CHAPTER 5 


N =2 SUSY and Parafermions 


5.1 Calabi-Yau Manifolds 

As we have seen, the critical dimension for the bosonic (super)string is 26 (10); 
therefore, we must compactify the extra dimensions so that we have an accept¬ 
able four-dimensional phenomenology. Because, to any order in perturbation 
theory, the dimension of space-time seems perfectly stable, we must necessar¬ 
ily resort to nonperturbative methods to compactify the unwanted dimensions. 
However, our techniques for analyzing nonperturbative phenomena are no¬ 
toriously primitive, and at present there is no way in which nonperturbative 
phenomena can be systematically analyzed for the string. 

Historically, this undesirable situation may be compared with the devel¬ 
opment of gauge theory itself. Gauge theory was first formulated (using 
Kaluza-Klein methods) by O. Klein in the 1930s. Its present-day incarnation is 
due to Yang and Mills, who reformulated the theory in 1954. However, because 
local gauge invariance was unbroken, it meant that the vector particles were 
massless and hence unacceptable for any weak interaction phenomenology. 
The mathematical mechanisms necessary to convert the theory into a useful 
phenomenological tool were unavailable in the 1950s. 

It was not until 20 years later, with the development of spontaneous sym¬ 
metry breaking, the Higgs mechanism, and the renormalization group, that an 
acceptable phenomenology became possible. We realize now that gauge theory 
is the fundamental theory of all particle interactions. The method of sponta¬ 
neous symmetry breaking made it possible to use 5 U(2) ® U( 1) to describe the 
electro-weak interactions, and the method of the renormalization group made 
it possible to use 517(3) color to describe the strong interactions. 

Today, we may be in a similar situation, where the mathematical tools to 
break string theory down to four dimensions are simply not available. We 




5.1 Calabi-Yau Manifolds 131 


will, therefore, have to make certain simple and natural assumptions, without 
any justification whatsoever. What is remarkable is that, with a few simplifying 
assumptions, a rich phenomenology that comes remarkably close to describing 
the real, low-energy world is possible. 

At present, compactification schemes [1] come in a bewildering variety of 
forms, which have given us a rich source of phenomenology: 

(1) free fermions and free bosons [2-5]; 

(2) orbifolds [6]; and 

(3) Calabi-Yau manifolds [7, 8]. 

In this chapter, we will concentrate on the last scheme, the Calabi-Yau 
manifolds. 

To see how these manifolds enter into our discussion, our first assumption is 
that 10-dimensional space-time compactifies to some maximally symmetric 
manifold M 4 , which satisfies 


RfAvaft — (/?/l 2)(g^ a ^ v ^ 8^8va > ) (5.1.1) 

and some six-dimensional compact manifold, called K , so that 

Af 10 -* M 4 ® K. (5.1.2) 

The second assumption is that N = 1 space-time supersymmetry survives 
the compactification process. At present, there is absolutely no physical ev¬ 
idence for supersymmetry. The bosonic partners of the quarks, neutrino, or 
electron have never been seen. However, supersymmetry is highly desirable 
phenomenologically because it solves the “hierarchy,” which plagues any uni¬ 
fied theory of strong, weak, and electromagnetic interactions. (Briefly, we wish 
to preserve two different energy scales in any grand unified theory (GUT): the 
GUT energy where unification takes place, which is just short of the Planck 
energy, and the energy found in our own low-energy world. However, higher- 
order Feynman diagrams will mix these two scales, so the masses of the 
particles will be unacceptably renormalized. We need a new symmetry, con¬ 
taining the scalar Higgs particles, to keep these two energy scales from mixing. 
Only supersymmetry, which can put scalar particles and fermions in the same 
multiplet, can perform this feat.) 

For our third assumption, we will postulate about the vanishing of certain 
fields, which we shall not need in our discussion. 

Given these three natural assumptions, we find some powerful results: 

(1) The manifold K is, in fact, a Calabi-Yau manifold, and M 4 is a Minkowski 
space. It is possible to find Calabi-Yau manifolds that reproduce the 
necessary SU(3) ® SU(2) ® U(\) low-energy symmetry group. 

(2) There must be a hidden, global, N = 2 superconformal symmetry. This 
N = 2 symmetry, in turn, is the key to giving us concrete examples of 
conformal field theories compactified on Calabi-Yau manifolds. 



132 5. Af = 2 SUSY and Parafermions 


At first, the assumption that N = 1 space-time supersymmetry in four di¬ 
mensions survives the compactification process seems to be an inconsequential 
one, without much physical content. However, we will shortly see that it is ex¬ 
tremely powerful and gives us enormous restraints on the nature of the four- 
and six-dimensional manifolds. 

Let us begin by analyzing the transformation properties of the massless 
sector of superstring theory [7]. The spin-| particle, the gravitino, which 
is the supersymmetric partner of the graviton, transforms as follows under 
supersymmetry: 


= (5.1.3) 

where / = 1, 2,..., 6. Because the supersymmetric generator Q annihilates 
the vacuum, the vacuum expectation value of S fa must vanish: 

(0|<5^|0) = 0. (5.1.4) 

In the classical limit, the variation of the fermionic field and its vacuum 
expectation value are the same, so we now have 

Sfi =K~ 1 D i t+ - = 0. (5.1.5) 

Therefore, € is a covariantly constant spinor. Usually, in flat space, to say 
that a scalar field is covariantly constant means that it is a constant. However, 
for spinors in curved space, this is not so; being covariantly constant places 
restrictions on the spin connection. 

Let us now take the derivative of this equation, and antisymmetrize. We find 

D[iDj]€ ~ Rij k i(T kl )e = 0, (5.1.6) 

where the T are Dirac matrices in six dimensions. By contracting indices, we 
can, in turn, show that 

RijTU^ 0, (5.1.7) 

that is, the manifold K has a Ricci flat curvature R t j = 0. 

Furthermore, the fact that e is covariantly constant means that there is a 
preferred direction in the six-dimensional tangent space. For example, if we 
take an unconstrained spinor and move it by parallel displacement, we pick 
up a factor D t €. If we take this spinor and move it completely around in a 
circle, we pick up D^D^e. Thus, by being parallel displaced around a circle, 
the spinor has rotated by a certain angle. If we make repeated circular paths, 
each time coming back to our starting point, then we will generate a group 
of displacements, which is nothing but 50(6), which is isomorphic to 5U(4). 
The group 50(6) is called the holonomy group of the manifold. 

Normally, a spinor with 50(6) symmetry has eight components. However, 
this can be decomposed as 8 = 4 © 4. Under 50(4), these two quartets 4 
transform with opposite chirality, so we will only take one of them. 

The next question is what is the largest subgroup of 50(4) that leaves in¬ 
variant the 4 of 50(4)7 By an 50(4) transformation, we can always put € into 



5.1 Calabi-Yau Manifolds 133 


the following form: 


€ = 


o\ 

o 

0 


(5.1.8) 


It is now obvious that the largest subgroup of St/(4) that leaves € invariant is 
the 3 x 3 subgroup of SU (4), that is, St/(3). Furthermore, out of the covariantly 
constant tensor, we can show that the following object is a true tensor: 

J) = -ig ik €T kJ €. (5.1.9) 

This tensor plays a key role in the analysis of manifolds, because it has 
interesting properties: 


jjj] = -<$*, D l jf = 0. (5.1.10) 


The first property of the tensor Jf is analogous to the number i found in 
ordinary complex variable theory, which squares to — 1. In fact, whenever one 
can write the tensor jf on a manifold, whose square is — 1, it means that the 
manifold is almost complex. (To be fully complex, one has to show that all its 
transition functions are holomorphic). 

The second property of this tensor, that its covariant derivative is zero, means 
that the manifold is Kahler , that is, its metric (in complex coordinates) can 
always be written as the derivative of a single potential function 


d 2 <P(Zk,Z k ) 
dli 3 Zj 


(5.1.11) 


Thus, from the rather simple assumptions we mentioned earlier, we con¬ 
clude that the manifold K possesses a large set of stringent properties, which 
collectively identifies it as a Calabi-Yau manifold [7, 8]. 

In general, these Calabi-Yau manifolds are quite complicated. However, 
it is possible to write examples of such manifolds and give their topological 
invariants. One way of constructing such manifolds with SC/(3) holonomy 
is to consider the complex projective space CP N , which is a complex (N + 
l)-dimensional space, where the points Z* are identified with XZ t for some 
nonzero complex number. The simplest six-dimensional Calabi-Yau is then 
CP 4 (which is eight dimensional) with the following complex constraint: 


5 

£4 = o. ( 5 - L12 > 

i = 1 

In this way, by taking CP N and properly placing enough constraints, one may 
obtain a series of Calabi-Yau manifolds. 

Although Calabi-Yau manifolds are in general exceedingly complicated, 
without explicit expressions for their metric tensor, what is remarkable is that 
one is able to compute many of their important phenomenological proper¬ 
ties. We will conclude this section with a short discussion of how to compute 



134 5. N = 2 SUSY and Parafermions 


one of the most important phenomenological properties of these manifolds, 
their Yukawa couplings [9-10]. From this, we can extract a vast number of 
phenomenological predictions from very general arguments. 

We will study one of the most interesting Calabi-Yau manifolds, due to Tian 
and Yau [11], which has precisely three generations of fermions. 

This manifold K is described by two sets of four complex coordinates x t 
and yi defined on CP 3 ® CP 3 (i.e., the point x t and y t is identified with Xx t and 
X!y t for complex X and X'). This space is subject to the constraints 

3 3 3 

^ = E x i 3 = 0 * P2 = T,yf = 0 ' P3 = T, x ‘y i = °- ( 5 - L13 ) 

i=0 i= 0 i=0 

This space has Euler number — 18. However, this number can be reduced by 
considering the Z 3 symmetry (for a = e 2ni/3 ): 

(x 0 , x u x 2 , X 3 ) -» (x 0 , a 2 x u ax 2 , ax 3 ), 

(5.1.14) 

(yo, yu J 2 . y3> -► (jo, aji, a y 2 , ay 3 ). 

Then the reduced manifold K /Z 3 has Euler number —y = —6, which gives 
us three fermion generations. 

Since it is difficult to calculate the Yukawa couplings without a knowledge 
of the explicit form of the metric tensor, we use a trick, exploiting the fact that 
one can write everything in terms of the three constraint equations p x . 

To extract the Yukawa couplings, we examine the low-energy limit of the 
heterotic superstring theory, which yields ordinary supergravity coupled to 
matter fields. We will find that the Yukawa couplings are contained within the 
point-particle supergravity fermion-boson couplings x/ry • D\jr. Written out 
explicitly, this coupling is given by 

L = j d l0 wyf = gf A y n ir B A mC f ABC , (5.1.15) 

where A, B, C are E% ® £g indices and m is a 10-dimensional Lorentz index. 

A vast number of simplifications occurs when we power expand this ex¬ 
pression in terms of harmonics defined on the product manifold K ® M 4 . The 
original 10 -dimensional space, labeled by w 9 splits into the four-dimensional x 
space and six-dimensional y space. Therefore, the zero modes of a field A(w) 
can be power expanded as 

A(w) = A l {x) 0 A*(y), (5.1.16) 

i 

where we sum over harmonics. Furthermore, because of supersymmetry, we 
can, to lowest order, write xj/ in terms of the vector field via: xjr a = A^y m ^ 
where y£± = and y = iy 5 ... yio- 



5.1 Calabi-Yau Manifolds 135 


Splitting the x and y integration, we have 


L = gijk j d A x 

gijk= f d 6 y ^/gco mnp A Aai A hbj Ap Ck € abc = f co A A ai 


(5.1.17) 

A A bj A A ck € abc , 


where A ai = A rhai dz m and where co mnp — £+y m y n y p t;+. A, B, C are now E 6 
indices in the 27 representation, d~ A Bc * s the symmetric cubic invariant in the 
27 of E 6 . a , b , c are 5f/(3) tangent space indices, m and m represent the space 
indices of the six-dimensional manifold. 

Notice that all dependence on the fermion field \(r has been collected into 
the term co , so that the dependence of the Yukawa coupling g ijk rests entirely 
on the gauge field A ai . 

Although the problem of calculating the Yukawa coupling g ijk at first seems 
intractable for an arbitrary Calabi-Yau manifold, we can use several more 
tricks. First, the gauge field A a is a closed one-form, modulo exact forms, and 
hence spans the cohomology space H 1 . Our task is to rewrite these gauge fields 
in terms of the topological properties of the manifold. To do this, we will use 
a result from deformation theory [10]. 

The key observation from deformation theory is that there is a one-to-one 
correspondence between the elements of H x and linearly independent polyno¬ 
mials or deformations q a that one can add to the defining polynomials p t . (This 
correspondence can be seen by noting that different choices for the defining 
polynomials give rise to physically distinct but topologically equivalent vacua.) 

Remarkably, we can rewrite the gauge fields A a in terms of these 
polynomials 


A a = X - bc q c dx h , (5.1.18) 

where x% c is the extrinsic curvature (which is a known function of the constraint 
polynomials p a ). One can show that the right-hand side is a closed one-form 
and also spans H l . We have now made the crucial transition, expressing the 
gauge field A a in terms of purely geometric quantities, that is, the polynomials 
q a . 

There is some arbitrariness, of course, in how we choose the q a . Since the 
physics remains the same when we make a gauge transformation on the gauge 
field, the physics must also remain the same if we make a certain change in 
the polynomial q a . More precisely, we can always maintain the properties of 
H l by adding to q a any linear combination of the original constraints p a and 
its derivatives p a A with respect to and y*: 

q a ~q a + X A p a A +c ab p b , (5.1.19) 

where X A and c ab are constant coefficients. 

For the case in question, we find that there are nine linearly independent q a . 



136 5. N = 2 SUSY and Parafermions 


Now let us insert the expression for the three gauge fields appearing in Eq. 
(5.1.18) in terms of three polynomials q a , r a , and s a into the definition of 
the Yukawa couplings appearing in Eq. (5.1.17). Since we are only interested 
in the ratio between between different Yukawa couplings, we can factor out 
unessential terms, so that Eq. (5.1.17) becomes 

g U k~ J ?W4 C -- (5-1.20) 

All Yukawa couplings are now defined in terms of the symmetrized product 
of three polynomials q {a r b s c \ Because of the degree of freedom in choosing 
q a , we can always add combinations of the original constraints p a and their 
derivatives p a A to this symmetrized polynomial and still preserve the desired 
properties 


q (a r b s c) ~ q (a r b s c) + X A(ab (//>) + c abc p d . (5.1.21) 

In general, different choices of the polynomials q^ a r b s c ^ yield different 
Yukawa potentials. However, by repeatedly using these equivalence relations, 
we find that they can all be set equal to the same polynomial, given by Y \ 3 i= i x* yt • 
In other words, no matter which combination of symmetric polynomials we 
started with, we can use the equivalence relation to reduce them to the same 
polynomial, times a constant k : 


3 

q (a r b s c) ~ K(q, r, s) fT'^' (5.1.22) 

i=0 

Notice that the Yukawa coefficients are now just encoded within the 
fc(q,r, s). Different choices for the symmetric polynomials correspond to 
different k (q, r, s). 

Our strategy to calculate numerical values for the Yukawa coefficients is now 
as follows. A particular choice of lepton and meson fields yields a particular 
choice of polynomials q a ,r a ,s a . We then take the symmetrized product of 
these three polynomials, use equivalence relations to reduce it down to the 
monomial Y\i and then calculate the coefficient K(q , r, s). In this way, we 
can calculate the numerical ratio between any two Yukawa couplings, which 
was our goal. (There is one technical point: we must also calculate the kinetic 
terms for each of these fields and diagonalize and normalize them properly. 
This is easily done by a simple normalization of the fields.) 

In summary, we have now given a geometrical derivation of the Yukawa cou¬ 
plings without using an explicit form for the metric tensor of the Calabi-Yau 
space. The Yukawa couplings gij k ~ ic(q, r, s ) depend only on the constraint 
polynomials p u which define the space, and the particular choice of polyno¬ 
mials q a , which define the isospin of the field we are analyzing. From these 
couplings, of course, we can extract a wealth of phenomenological information 
about the theory. 



5.2 N = 2 Superconformal Symmetry 137 


As one can see, the Calabi-Yau manifolds are quite complicated and dif¬ 
ficult to construct because of their high degree of nonlinearity. Originally, it 
was thought that this complication would prevent any simple analysis of their 
conformal properties. Thus, it was quite remarkable when Gepner used a naive 
tensoring of the N = 2 minimal series to generate conformal field theories 
that had all the properties of the Calabi-Yau compactification. A seemingly 
intractable, nonlinear problem was reduced to simple representations of N = 2 
superconformal models. In this chapter, we will discuss the ramifications of 
this result. The calculation of Kazama and Suzuki, for example, shows that 
Gepner’s original construction could be generalized to yield N = 2 models 
with the gauge group E% ® which is phenomenologically desirable. 


5.2 N = 2 Superconformal Symmetry 

Although the N = 2 superconformal symmetry is not a symmetry of physical 
states, the study of such models has proven to be a key aspect of realistic 
phenomenology. The N = 2 superconformal algebra differs from the usual 
N = 1 algebra in that there are now two distinct fermionic components to the 
energy-momentum tensor, G±(z), and that there is also another U( 1) current 
J{z). The operator product expansion is given by 

T(w)T(z) ~ 

T(w)J{z) 

T(w)G±(z) ~ 

J(w)J(z) ~ 

J(w)G±(z) ~ 

G+(u>)G_(z) - 

Written in components, this algebra can be written as 
[1L W , L m ] — (Yl T7l)L n + m + (l/4)c(n n)8 n + m , 0 * 

[L n ,G i r ] = (n/2-r)G i n+r , 

[f wn Jji\ — 

[«An> Jn\ — CA7 l8m,—ni 

[Jm, G l n \ = i€ ij G J m+n , 

{G‘ r , G{} = 2 8”L r+s + ie iJ (r - s)J r+s + c[r 2 - (1/4)] 


c/2 


2 T(z) dT(z) 


(w — z) A (w - z) 2 
J(Z) , dj(z) 


w 


(w — z) 2 
3 G ± (z) 
2(w - z) 2 
c/3 

(w — z) 2 ’ 

G±(z) 


+ 


+ 


w — z 
3G± 


w 


± 


w — z 
2c/3 


(5.2.1) 


+ 


(w — z) 3 (w — z) 2 


2J(z) | 2 T(z) + dJ(z) 


w — z 



138 5. N = 2 SUSY and Parafermions 


where c = c/3 and G± ~ G 1 ± iG 2 . 

Normally, we consider N = 2 superconformal symmetry to be unphysical. 
However, we now come to the interesting but unexpected observation that N = 

1 (space-time) supersymmetry can give rise to a global N — 2 (world sheet) 
superconformal symmetry [12-15]. Since N = 1 space-time supersymmetry 
is the key to modular invariance (via the GSO projection), we see that N = 

2 superconformal symmetry may play a key role in constructing physically 
realistic conformal field theories. Not only does the AT = 2 theory arise from 
N = 1 space-time supersymmetry, but it is also useful for compactifying on 
Calabi-Yau manifolds. 

We recall that the NS—R superstring has local N — \ superconformal sym¬ 
metry and N = 1 space-time supersymmetry. However, after compactification, 
we can show that there is a hidden N = 2 superconformal symmetry that 
emerges. We can see this is several ways, at the level of operators [14] or at 
the level of the sigma model [15]. 

First, let us show operatorially how this hidden N — 2 superconfor¬ 
mal symmetry emerges. This operator approach is rather interesting, because 
this hidden N — 2 symmetry emerges rather unexpectedly when one starts 
with operators transforming only under the smaller N = 1 superconformal 
symmetry. 

Let us write the operator expression for the N — 
supersymmetry operators after compactification 

Q-mAz) = e-* /2 s a V(z), 

<2l/2;«(z) = e~ 4 ' ll Sa^{z), 

where 

S a = e iaH , S i =e i * H , 

where 

« = (±i,±i), Q! = (±i,q:i). 

The four-dimensional S a only uses two bosonized H t fields, and hence it 
is a reduction of the spinor that we originally used to establish the full 10- 
dimensional space-time supersymmetry, which required five bosonized fields 
in Eq. (3.2.13). Thus, these reduced spinors have conformal weight f. The 
other three bosons H, contained in the original spinor make up the new field 
E. Since the weight of e qc> ’ is —\q(q + 2) [see Eq. (3.2.15)], the E fields have 
weight |. 

Let us now insert these expressions for the four-dimensional spinor Q into 
the proper relations for the superalgebra. To preserve supersymmetry, we find 
[14]: 

E(z)E f (u;) ~ (z - w)- 3/4 / + (z - w) l/4 \j{w), (5.2.6) 

where J(z ) is a new field, with weight 1. This new field is the candidate for a 
U{ 1) field, which is necessary to build up the N = 2 superconformal symmetry. 


1 space-time 

(5.2.3) 

(5.2.4) 

(5.2.5) 



5.2 N = 2 Superconformal Symmetry 139 


Thus, although we started with N = 1 superconformal fields, the larger set of 
N = 2 superconformal fields arose from the product of N = 1 superconformal 
fields, if we demand N = l space-time supersymmetry. 

The operator product of this new field J(z ) with the supersymmetric current 
gives us a new field T F with weight |: 

J{z)T f (w) ~ r F /(z - w). (5.2.7) 

Now that we have two fields with weight |, let us define two fields T F as 


T F = -j=(T+ + Tf), 


(5.2.8) 


By consistency of the algebra, we can then show 


T+(z)Tf{w) 


c/4 J(w)/2 T(w)/2 + d w J(w)/4 

(z — w) 3 + (z — w) 2 z — w 


(5.2.9) 


By calculating the other commutators of the algebra, we find that we have the 
full N = 2 commutation relations. 

It may seem surprising that we could start with the usual N = l fields T and 
T F and the space-time supersymmetry operator, which contained E, and then 
construct the N = 2 fields and their commutators. The key is that these other 
operators arose from operator products of the known fields. The U( 1) field J 
arose from the operator product of E with E^, and a new field T F arose from 
the product of E E ^ 7>. 

There is a second way [15] of seeing how this N = 2 superconformal 
symmetry emerges from N — 1 space-time symmetry, and this is through 
the sigma model. We begin by noticing that the supersymmetric generators, 
like the fields, may be left moving or right moving. Thus, we may have p 
positive chirality supersymmetries and q negative chirality supersymmetries 
in the same theory. We refer to this as (p, q) supersymmetry. 

We use, for example, (1,0) supersymmetry (sometimes called N — \ su¬ 
persymmetry) to describe the right-handed supersymmetry of the heterotic 
string. In this section, we will be concerned with generating a hidden (2, 0) 
supersymmetry starting from an TV = 1 space-time supersymmetry. 

Let us begin with a real scalar superfield d>(jt, 6), where the bosonic coor¬ 
dinate is x ± but there is only one positive-chirality Fermi coordinate, 9+ = 0. 
The generator of (1, 0) supersymmetry is then given by 


= ^ + <5 ' 2 ' 10) 
Let us now construct a sigma model with this symmetry based on the 
superfield 


cp(x, 9) = (j>{x) + 9k(x). 


(5.2.11) 



140 5. N = 2 SUSY and Parafermions 


Let us assume that we have an N — 1 superconformal action given by 

S = —i j d 2 x dd^ gij {<t>) + 6 i7 (4>)]d<P' ^2-^ <t>\ (5.2.12) 

where D is the supercovariant derivative 


_ 3 . d 

D — — 4" i - 

dd 3jc+ 

and gij is symmetric and &, 7 is antisymmetric. 

If we perform the 0 integration, then we arrive at 


S=J d 2 x{[g ij {4>) + b i j{4>)}{d + <t> i 
+ ig, 7 (0)V(3_V + r/ ; 3_3^)}, 


where 


r ‘ jk -\jk] +gUTsu ’ 

Tijk — 2 bjk,i + bki,j\ 


(5.2.13) 


(5.2.14) 


(5.2.15) 


where {^} is the usual Christoffel connection, and T ijk is a totally 
antisymmetric torsion. 

The action is invariant under the usual N = l supersymmetry 

<50* = 8X l = -i6 3+0 f . (5.2.16) 

Assume, for the moment, that the action is also invariant under a chiral 
symmetry 


8k' = J}(4>)k J . (5.2.17) 

Then, the action is also invariant under a second, hidden superconformal 
symmetry, given by 


8 ^ = 67j(0)V, 

5(JjV)= -I6 0+0 1 '. 


(5.2.18) 


But, this is precisely the other supersymmetry transformation necessary to 
obtain N = 2 supersymmetry. Therefore, the question is what conditions do 
we have to place on the manifold in order that we have chiral symmetry and 
that this new symmetry anticommutes with the first? 

In order for the two supersymmetries to anticommute, we must satisfy 


ji jj _ 

N k - = 
ij 


j i j k 


_ j l j k — n 
- u * 


(5.2.19) 


This tensor N k j is called the Nijenhuis tensor , and its vanishing means that 
the manifold is complex. Also, for the action to be invariant under the chiral 



5.3 N — 2 Minimal Series 


141 


transformation, we must impose 

8kiJiJ)=gij , (5.2.20) 

that is, the metric is Hermitian, and the complex structure is covariantly 
constant 


V/4' = JL + TU - r,U ; = 0. (5.2.21) 

If bij == 0 , then K is Kahler. If by ^ 0, then K is a Hermitian manifold with 
torsion. 

In conclusion, we see that there are now two supersymmetry generators that 
anticommute if the background metric satisfies a few mild constraints. Thus, 
N = 2 world sheet supersymmetry has emerged from N = 1 supersymmetry 
at the level of the a model. 


5.3 N = 2 Minimal Series 

As before, we will briefly repeat the same steps used earlier to calculate the 
minimal series of the conformal and N = 1 superconformal series. Recall 
that we first write the Verma modules of the theory, consisting of all ladder 
operators hitting a vacuum state. Then, we take the matrix element of these 
Verma modules and form the Kac determinant. 

By analyzing the Kac determinant, we can determine for which values of c 
and h we have irreducible and reducible representations of the N = 2 algebra. 
(We will use the convention that the c appearing in the commutation relations 
of the N = 2 algebra are related to c and c of the conformal and N = 1 
algebras by the following: c = c /3 = c/2.) 

The irreducible representations usually have an infinite number of primary 
fields. However, we will be interested in the reducible series, where we have 
sequences of null states. By analyzing these representations, we can truncate 
to a finite number of primary fields. 

To construct the Verma modules of the algebra [16, 17], let us first observe 
that there are three possible modings of the various oscillator states. In the NS 
sector, the L n and J n are integer moded, while the G l m are half-integral moded. 
We will call this the A sector, for antiperiodic boundary conditions. For the R 
sector, the L„, and G l m are all integral moded. We will call this the P sector, 
for periodic boundary conditions. There is, however, a third possibility, and 
that is a twisted case when only L n and G l m are integral moded, and J m and 
G^ are half-integral moded. We will call this the T sector. 

(We should mention that it is also possible to generalize the boundary con¬ 
ditions so that we interpolate between the R and NS sectors. For example, we 
can choose L n and J n to be periodic, but m € Z + + + r) for G l m , where r\ 
is between 0 and 1. Then G l (e 2ni z) ^ e 27TlT] G l (z). This boundary condition 
is useful when studying the spectral flow between theories, since by varying 



142 5. N = 2 SUSY and Parafermions 


r] we can go between the R and NS sectors. We will not, however, study this 
generalization.) 

This means that we will have three sets of coefficients appearing in the 
Kac determinant. Let us carefully define each set. Although the details of the 
construction are quite messy, the final answer for the conformal weights of the 
primary fields is quite simple. (The reader may skip to the end of this section, 
where the main results are summarized in Eq. (5.3.16-5.3.18).) 

A Sector 


Let us define the A sector partition function p A , which generalizes the usual 
partition function for the bosonic or N = 1 string. Because we have two 
supergenerators G l _ n acting on the highest weight vacuum state, the partition 
function for an irreducible Verma module is more complicated than that of the 
usual N = 1 theory. The partition function is defined by taking the coefficients 
p A (n,m) of a double power expansion of the following power series: 


Y J PA{n,m)x n y m = \\ 

n,m k=\ 


(1 +x k ~ x l 2 y)(\ + x k ~ l/2 y~ l ) 


We also introduce the coefficients p\ and p T : 

^ Px(n, m; k)x n y m = [l + j4*y'gn(*)] -1 ^ px (n, m)x n y m , 

n,m n,m 

where X = A, P and 

sign(fc) = +1, k > 0, 

sign(fc) = — 1, k < 0, 

sign(0) = ± 1, for P ± . 


(5.3.1) 


(5.3.2) 


(5.3.3) 


For the NS sector, the Verma modules are given by the standard set of ladder 
operators multiplying a highest weight vacuum state. 

Because the zero modes are given by Lo and Jq , the vacuums are labeled 
by two indices corresponding to their eigenvalues: | h,q). Then the Kac deter¬ 
minant for the super-Verma module generalizes the usual Kac determinant of 
Eqs. (2.5.6) and (2.7.17): 

det M A m (c,h,q) = n (f r A s y^- rs/2 ' m) 

1 <rs<2n 

X Y\ (g k y A[n ~ w ’ m ~ sigD(k);k] (5.3.4) 

k€ Z+l/2 

for s even, where 

f A (c, h, q) - 2 (c - 1 )h - q 2 - \(c - l) 2 + \[(c - l)r + sf, 
g A (c, h, q) = 2h- 2kq + (c - 1 )(k 2 - ±), 

and where k e Z + i. 


(5.3.5) 



5.3 N = 2 Minimal Series 


143 


Repeating the same steps that we used for the minimal model, we now look 
for the zeros of the Kac determinant in order to find null states. By eliminating 
these null states, we find the minimal series. 

For the A sector, the condition for the unitary minimal series is given by 



h = 

9 = 


[jk - +] 

m 

(j - k) 

m 


(5.3.6) 


for m > 2 and j, k e Z + j, and0 < j,k, j + k < in — 1. 


P Sector 


We can repeat the same steps for the P sector, with a few modifications. The 
partition function for the P sector is given by 


Y,Pv(n,m)x n y m 

n,m 


nr , 1 \n ( 1 +^)( 1 +A' 1 ) 

Vy) U (i -**) 2 


(5.3.7) 


For the P sector, the zero modes are given by L (j , To, and G‘ r There are two 
types of highest weight vacuums, given by \h, q =F \)±, which in turn satisfy 
(Gl^iG 2 0 )\h,q T j)± = 0. 

The Kac determinant is given by 

det M v n m (c,h,q)= ]”[ (/ r P s ) /,p( "“ ri/2 ' m) ( 5 . 3 . 8 ) 

\<rs<2n k€Z 

where 

frsic, h, q) = 2 (c - 1) (h - 0 - q 2 + \[(c - l)r + sf, 3 

g p k (c, h, q) = 2h- 2kq + (c- 1 )(k 2 - 5 ) - 3 , 
where k e Z. 

As before, by looking for the zeros of the Kac determinant, we can find the 
unitary minimal series, which is given by 

c= 1 - (2/m), 

h = c /8 + ( jk/m ), (5.3.10) 

q = sign( 0 )O' - k)/m, 

for m > 2, j, k € Z, and 0 < j — 1, k, j + k < in — 1. 



144 5. N = 2 SUSY and Parafermions 


T Sector 


Last, we analyze the T sector. Its partition function is given by 


_ 00 

Y Pi{n)x n = Y[ 


n 


k =1 


(1 +JC*X1 +X k ~ l/2 ) 

(1 -X*)(l -X k - l f 2 )’ 


(5.3.11) 


For the twisted T sector, the zero modes are given by Lo and Gj r The highest 
weight state | h) actually splits into two states of fermion parity (— 1 ) F = ± 1 . 
Then, we have 

det M J ±n (c,h) = (h-c/S ) PT<n)/2 ]”[ (f?J PT(n - rs/1) , (5.3.12) 

\<rs<2n 

where 

flic, h) = 2 (c - 2 )(h - c/ 8 ) + |[(c - l)r + sf, (5.3.13) 

where s is odd. 

For the T sector, we have 

c = 1 — (2/m), h = (c/S) + (m-2rf/16m, (5.3.14) 

for integer m and r such that 2 < m and 1 < r < m/2. 

Now, let us summarize what we have learned by this exercise. We find that 
for all three sectors, we have the important condition for the unitary series: 


c = 1 - (2/m) 


(5.3.15) 


which can be rewritten as 

3 k 

N = 2 minimal series: c =-, k = 1, 2,..., (5.3.16) 

k + 2 

so that c < 3. As in the bosonic case, we find that the primary fields 
corresponding to the minimal models are labeled by two integers. 

For the A sector, the conformal weight of the primary field was found to 
be h = (jk — \)/m. Let us make the following substitution of variables: 
m -> £ + 2, j - k q, and j + k - 1 /. Then, the conformal weight A/ tq 

and U(l) charge of the minimal primary fields can be rewritten as 


_ /(/ + 2 ) q 2 g 

l ' q 4(k + 2) 4(k + 2) ’ U k + 2’ 


(5.3.17) 


for l = 0,..., £andg = — /+2,...,/. In this case, we have/, the principle 
quantum number, which ranges from 0 to k. Also, we have the integer q, which 
labels the t/(l) charge and is defined modulo 2 (k + 2 ). 

Similarly, in the P sector, the conformal weight of a primary minimal field 
was given by h = (c/ 8 ) + jk/rh. By making a similar change of variables, we 
find that the conformal weight and the U(l) charge is given by 


_ 1(1+2) (q± l) 2 1 q±\ \ 

l ' q 4(k + 2) 4(A: + 2) 8 ’ U k + 2 T 2' 


(5.3.18) 



5.4 N — 2 Minimal Models and Calabi-Yau Manifolds 145 


These last two equations for the conformal weights and U(l) charges for the 
primary fields will be crucial for our later discussion. 


5.4 N = 2 Minimal Models and Calabi-Yau Manifolds 

One of the remarkable surprises coming from the N = 2 superconformal field 
theory is the one-to-one relationship between certain Calabi-Yau manifolds 
and certain minimal theories [18,19]. Normally, Calabi-Yau manifolds are so 
nonlinear and complicated that any simple representation of them seems out 
of the question. However, we will see that by simply tensoring certain minimal 
theories that we found earlier, we will find a one-to-one correspondence with 
certain Calabi-Yau manifolds. (In hindsight, perhaps this is not that surprising. 
Calabi-Yau manifolds, in some sense, make the minimal assumptions that 
one can make on a manifold and retain N = \ space-time supersymmetry, 
while N = 2 superconformal field theories, we saw earlier, are also associated 
with manifolds with N = l space-time supersymmetry. However, it is still 
remarkable that this nontrivial correspondence appears so early, at the level 
of tensoring minimal models together.) In Chapter 7, we will give a heuristic 
argument which helps to explain the origin of this interesting result. 

If we take the naive case of simply tensoring several independent minimal 
models together, then the central term of this product space is just the sum of 
the central term of each minimal model. This sum, in turn, must equal | times 
the number of dimensions remaining after compactifying the 10-dimensional 
NS-R space. Thus, the condition we wish to satisfy is given by [18, 19]: 

c = = 1(10 - D). (5.4.1) 

Symbolically, we will denote the tensoring of Z conformal field theories, each 
with ki , by the notation {k\ ^ • • • k). We will now show the equivalence of these 
minimal models and the Calabi-Yau manifolds by showing that they have the 
same topological properties, that is, they have the same discrete symmetries, 
the same fermion generation numbers, etc. 

We will first calculate the group of symmetries that will leave this conformal 
field theory invariant. For each k t , there is a symmetry that leaves the theory 
invariant given by Z ki+ 2 - To see this, let us construct the character associated 
with the primary fields of the theory, which is indexed by the integers /, q, and 
5 (which indicates if the sector is NS or R). Let us call the character associated 
with this primary field x l q S \ where s is even (odd) for the NS (R) sector. 

Then, construct the following partition function: 




(5.4.2) 



146 5. N = 2 SUSY and Parafermions 


where x, y are complex numbers. Then, under the standard modular 
transformation on r in Eq. (4.1.3), we find 

Z(x, y) -* Z(ax + by , cx + dy ), (5.4.3) 

which holds if x, y are both members of Z*+ 2 . (In the next section, we will 
give an explicit form for these characters and partition functions, using the 
parafermionic representation, from which this identity can be shown.) Thus, 
the theory has a discrete Z k+2 symmetry due to the fact that x, y can be any of 
the elements in Z fc+2 . 

For the general case of tensoring many minimal models together, each la¬ 
beled by ki , the discrete symmetry of such a product is given by the product 
of the individual discrete symmetries 


G — Zk { +2 ® Z*2+2 ® ® Zkt+2- (5.4.4) 

If all the k( are the same, then the group G has an additional symmetry group 
given by 5/, the permutation group of / identical objects. Thus, for / identical 
conformal field theories and k + 2 = /, the group of symmetries is 

G = S&Z^Zu (5.4.5) 

where the tilde represents the semidirect product. 

Example: (3 5 ) 


An example will help to illustrate this construction. Equation (5.4.1) tells us 
that if we wish to have physical D = 4 theories, then we must set c = 9. The 
simplest c = 9 theory can be obtained by tensoring five copies of the k = 3 
theory, which we denote by (3 5 ). 

From the above arguments, this manifold has partition functions that can 
be labeled by elements of Z 5 . We have five copies of Z 5 , and therefore, we 
also have the symmetry S 5 due to interchanges of the various Z 5 factors. 
Its symmetry group is therefore given by Ss®Z 5 5 /Z 5 . This group has 75,000 
elements. 

Now, let us compute the generation number (i.e., the number of identical 
fermion multiplets) for the theory. We recall that the U(\) charge for the ith 
theory is given by Eq. (5.3.18): 


ki - 2q { 




2(ki+2) 


(5.4.6) 


The ground state Ramond state is the tensor product of the ground state of 
each of the individual theories. However, the sum of these U( 1) charges must 
equal ±^. Assuming — we have YlQi = or 


E 


k+2 


= 1 . 


0 <qi < ki, 


where we have used the fact that c = 9 = 5Z(3fc,)/(^ + 2). 


(5.4.7) 



5.4 N = 2 Minimal Models and Calabi-Yau Manifolds 


147 


It is now a simple matter to compute the generation number for various 
theories. One simply counts the number of ways in which we can choose the 
integers q t . Each set of integers {q t } that satisfies Eq. (5.4.7) yields an equivalent 
fermion generation. For example, the number of ways we can choose five 
integers, each between 0 and 3, such that they sum to 5 is equal to 101. Thus, 
the generation number for (3 5 ) is 101. Likewise, the generation number of (6 4 ) 
is 149. 

Now that we have calculated the discrete symmetries and the number of 
fermion generations associated with some of these tensored minimal models, 
let us do the same for the Calabi-Yau manifolds. First, let us compare them to 
the Calabi-Yau manifold CP 4 constrained by 

5 

= (5.4.8) 

i=i 

This surface enjoys the global symmetry 

G = S s ® Z s 5 /Z 5 . (5.4.9) 

This symmetry group arises because S 5 permutes the different variables z, 
within Eq. (5.4.8). Also, the manifold is invariant if we make the following 
transformation in the constraint equation 

Zi -* e 2ni/5 Zi . (5.4.10) 

(The last factor of Z 5 is due to the fact that the overall phase in CP 4 is irrelevant.) 
This symmetry group is precisely the same as that found for (3 5 ). Likewise, 
the Calabi-Yau manifold specified by 

n 

X>? = ° (5-4.11) 

/ = 1 

has the symmetry group 

S n ®Z n JZ n , (5.4.12) 

which is precisely the same as the one found for the minimal model in Eq. 
(5.4.5). 

In addition, the generation number for a manifold can also be calculated 
topologically. Because the number of fermions that can propagate on a man¬ 
ifold is topologically fixed, the generation number is directly related to the 
Dirac index, Euler number, or Hodge numbers. We find that the generation 
number is 101. This agrees exactly with the fermion generation number found 
for (3 5 ). 

Similarly, we can proceed to establish a one-to-one relationship between 
certain tensor products of minimal models and other Calabi-Yau manifolds by 
examining their discrete symmetries, their generation number, their fermion 
content, etc. [18, 19]. Although this does not constitute a rigorous proof of 
the correspondence between these theories, it gives overwhelming evidence of 



148 5. N = 2 SUSY and Parafermions 


their equivalence. In Chapter 7, we will give some arguments which help to 
reveal the origin of this fascinating result. 


5.5 Parafermions 

As in bosonic theory, we wish to find a specific representation of the N = 2 
minimal model by which we can calculate the structure constants of the fusion 
rules and the correlation functions. In particular, we wish to calculate the 
character x l q S) for the primary fields of the minimal model and also the partition 
function. In this way, we can confirm that the partition function has the Z k+2 
discrete symmetry in Eq. (5.4.2). 

There is a representation of this algebra in terms of parafermions [20, 21] 
that allows us to calculate the main features of this algebra. This representation 
of the N = 2 algebra can be given in terms of the parafermion field 0*, used to 
build up representations of the Z k spin models found in statistical mechanics. 
(The Zjk symmetry refers to the discrete values that the spins in a lattice can 
assume. For example, in an ordinary Ising model, the spins can take on ±1, 
so the spin assumes values in Z 2 . The generalization to the Zk model is thus 
straightforward, depending on the roots of unity.) 

Let us postulate the existence of a parafermion field and a free boson 
field 0. Combine them in the following fashion: 


G+(z) = y 


G.(z) = y 

• k + 2 *le - 

J(z) = i. 

h + 2 3 *’ 


where a = f(k + 2)/k. 

If we plug this representation back into the N = 2 operator product ex¬ 
pansion, then we find that the N = 2 algebra is satisfied as long as we 
fix 

~ (w - z )- 2 (*~ 1 )/* -|-. (5.5.2) 

For k — 2, this expression reduces to the usual one for ordinary fermions, 
as in Eq. (2.2.15). For other values of k , however, this expression shows that 
the fields are parafermionic. From Eq. (5.5.2), we see by scale arguments that 
the field 0i has conformal weight (k — 1 )/k. From Eqs. (3.1.28) and (3.1.29), 
we see that the exponential field e ia<t> has weight a 2 /2 = (k + 2)/2 k. If we add 
them together, we find that the operators G± in Eq. (5.5.1) have weight |, as 
desired. 

If we calculate the operator product expansion of two energy-momentum 
tensors T(z), then the central charge of the full algebra is the sum of the cen- 



5.5 Parafermions 149 


tral charge of the free boson field (j>, which has c# = 1, and the parafermion 
field, which contributes c* = 2(k - 1 )/(lc + 2). (To see this, notice that 
the parafermionic theory can be obtained from a GKO construction with 
SU(2) k /U(\), so the central charge is given by c = [3 k/(k + 2)] — 1 = 
2(k — 1 )/{k + 2).) Thus, the full central charge of the N = 2 algebra is given 
by 


c — + Cf = 1 + 


2 (* ~ 1 ) 

(k + 2) 


3 k 

k + 2' 


(5.5.3) 


which confirms that we have a representation of the minimal series as in Eq. 
(5.3.16). 

Now that we have a representation of the minimal N — 2 algebra with 
the correct central charge in terms of parafermions, we would like to con¬ 
struct the primary fields and the characters of the N = 2 algebra out of these 
parafermions and solve for the entire theory. In particular, we will write a 
parafermionic representation of the primary field V l m . To do this, we find it 
necessary to introduce the complete set of parafermionic operators, of which 
is only one member. 

In the Z* model, we have a set of currents xfr t , l = 1 ,2,..., k — 1, which 
have an operator product expansion given by 


~ c u ,(w - z)~ 2ll ’ /k [ir l+ u{z) + 0(w - z)], l+l' <k, 

\jri{w)\lrl(z) ~ c Uk - V (w - zY 2l(k ~ vyk [fi-v{z) + 0(w - z)], 1' < l, 

i'i( w ) x l'i (z) ~ (w - z) _2A '[/ + (2A i/c)(w - z) 2 T(w ) + 0(w - z) 3 ], 

(5.5.4) 

where 1/7 = In addition, we have the operator product expansion with 
regard to the energy-momentum tensor 


T(w)f,(z) ~ 


&lMz) dzMz) 
(z — w) 2 (w — z) 


+ 0 ( 1 ), 


(5.5.5) 


where 


A, 


l(k -1) 
k 


(5.5.6) 


In addition to the current \fri, we must also introduce the field where 

00 

t\(w>W l m (z) ~ (u> - zY m/k+j - l A (m+1)/k -jX{r l m (z), 

j=-oo 
00 

^\(wW m (z) ~ ( w - z) m/lc+i ~ l Aii- mVk -jt l m (z), 

j=-oo 


(5.5.7) 



150 5. N = 2 SUSY and Parafermions 


where the A ’s represent certain operators, whose precise nature is not important 
to our discussion, and where the dimension of \// l m is given by 


{ /(/ + 2 ) m 

= 4(k + 2) ~ 43k’ 


The point of introducing these extra fields ^ associated with the Z* model 
is that we can now write an explicit representation of the primary fields of the 
N = 2 theory, which is given by 


V 1 — i lr l • p ia ^(z) 

v m r m ' c 


(5.5.9) 


We demand that the above expression transform as a primary field under the 
N = 2 algebra, which places a constraint on the constant a m . Since all operator 
product expansions for a primary field are completely given, we can calculate 
the operator product of V l m , with the generators G ± , and the value of a m . The 
calculation yields 


Of-m — 

Um = 


m — [isign(O) + a]k 
y/k(k + 2) ’ 

m — [^sign(O) + 1 + a]k 

Jk(k + 2 ) 


m — ... ,1 — 2, l, 
m = l, l + 2,..., 


(5.5.10) 


where, in the NS sector, we have a — \ and sign(0) = — 1, while in the R 
sector, we have a = 0 and sign(0) = ±1. 

The final step in the proof that V l m are primary fields is to calculate their 
conformal dimension. Since the conformal dimension of \ ,l m is the sum of the 
dimension of the and the exponential of the free boson field <fi, we find that 
the final dimension is given by the following sum: 


A(0 = A' m + i^ 


1(1 + 2 ) m 2 

4 (k + 2) 4 (k + 2) 


(5.5.11) 


for the NS sector, with / = 0, 1,..., k and m — —l, —l + 2,... ,1. For the R 
sector, we have 


A (0 = *!„ + & 


1(1 +2) (m ± l) 2 1 

4 (k + 2) ~ 4(k + 2) + 8' 


(5.5.12) 


Comparing these weights with the weights given by Eqs. (5.3.17) and 
(5.3.18), we find that we have an exact correspondence, as expected. Thus, 
we now have an explicit representation of the primary fields of the minimal 
series in terms of parafermions via Eq. (5.5.9). 

Our next goal is to calculate the characters and partition functions associated 
with each primary field. Although this may seem an arduous task, there are 
many simplifications because of the relationship between parafermions and 
the WZW model, whose characters and partition functions we calculated in 
Chapter 4. 



5.5 Parafermions 


151 


We have seen that the Z* currents are sufficient to give us a representation of 
the minimal model of N = 2 superconformal symmetry. However, what is also 
interesting is that these same parafermion fields can give us a representation 
of the WZW model. The quantum numbers /, m found in the parafermion 
model, in fact, have a direct counterpart in the quantum numbers found in the 
SU{ 2) model. For example, let us construct the following operators from the 
parafermions: 


7 + (z) = Vkfi(z):e^ z)/V]i 

J~(z ) = Vk f\{z) : *-''*‘W* (5.5.13) 

j\ z ) = Vk d z 4>(z). 


Because we know all the operator products, we can then check that the affine 
SU(2 ) operator product relations are satisfied 


J a (w)J b (z) 


kq 


ab 


[ f abc J%z) | 


(w — z) 2 (w — z) 


(5.5.14) 


as long as we fix 


f ++ = -/ +3+ = -r 3 - = \r~ 3 = i, 


,33 


\q + ~ = \q~ + = 1. 


(5.5.15) 


Now that we have established a relationship between the Z* parafermion 
fields and the algebra of the WZW SU (2) model, let us establish a relationship 
between their primary fields. Let us denote the primary fields of the WZW 
model by G 1 ^^, obeying the usual relations for a primary field 


ja 

J n ^m,m 

(Jo 

What is remarkable is that we can establish a direct relationship between the 
primary fields found in the Z* parafermion model and the WZW SU( 2) model: 

G'i* = V'i, : exp [im<Kz)/2y/k + im^(z)/2y/k] (5.5.17) 

where 0 < l < k and — / < m < l. 

Let us now calculate the final goal, which is to calculate the characters and 
partition functions of the superconformal minimal model. At first, the thought 
of calculating the characters of the N = 2 superconformal theory may seem 
prohibitive, until we realize that we can use some tricks. We will calculate the 
partition function of the N = 2 theory in terms of the partition function of the 
parafermion theory. 

Let us define 


= %£ = 0 , 


.t3 


= VI ~ = 0 . 


(5.5.16) 


Z I,m = Tr g 2 - ,nr(Xo_c / 24 ) 


(5.5.18) 



152 5. N = 2 SUSY and Parafermions 


where we only trace over the states in the Verma module given by specific 
values of {/, m). The key is to realize that, because the primary field V l m in Eq. 
(5.5.9) is a product of a bosonic and parafermionic piece, the operator L 0 is 
just the sum of the L 0 over the bosonic and parafermionic pieces. 

The trace then splits up into the product of these two factors. The product 
over the bosonic piece yields the usual Dedekind r] function, and the other 
piece yields the trace over the parafermionic field 

Z l,m = (5.5.19) 

where Z‘ p m is the contribution over the parafermionic piece. Using Eqs. (5.5.11) 
and (5.5.12), we find 


Z l ’ m - Tr exp 


2nir 


' 1(1 + 2 ) 

4 (k + 2) 


+ N C - c/24 


where N c is the number operator, where 


Lo — N c + 


1(1 + 2 ) 

4 (k + 2) 


and 



(5.5.20) 

(5.5.21) 


<L(J) = ex P 


Inix 


( 1(1 + 2 ) 

\4(k + 2) 


4k 



(5.5.22) 


where c = 3 k/(k + 2), p n is the number of states in the irreducible 
representation with highest weight /, and m is the eigenvalue of /q . 

Given the partition function over the parafermion theory, we can now 
construct the partition function over the N = 2 theory. It is given by 

Xm S) ~ C m+4j-s (T)©2/n+(4y'—s)(fc+2),2fc(&+2)(U 2 kl, «), (5.5.23) 

j mod k 

where 

® n>m (x,z,u) = e- 2 * iu e 2 * lTmjl+lnil \ (5.5.24) 

j€Z+n/2m 

which is the same 0 function that appeared in Eq. (4.5.18) for the WZW model. 

We have now succeeded in our goal: an explicit formula for the character 
of the primary fields of the superconformal minimal model, obtained using 
the parafermionic representation. From this, we can, in principle, calculate the 
modular properties of the N = 2 minimal series. 


5.6 Supersymmetric Coset Construction 

One defect of the previous construction is the presence of extra t/(l) factors 
in the gauge group; these factors are known to cause complications at the loop 



5.6 Supersymmetric Coset Construction 153 


level [22-23]. However, these extra U( 1) factors are difficult to remove in the 
previous construction. 

We will now review a more ambitious construction [24] based on conformal 
field theories with c > 3, which generalizes the previous construction. We 
recall that, with minimal models, this is impossible to implement without 
taking tensor products of several minimal models, each with c < 3. However, 
we learned from previous chapters that the GKO coset construction allows 
for representations with c > 3, so we will now investigate supersymmetric 
coset constructions to search for nonminimal models that are not bound by 
this restriction. 

In order to carry out this ambitious plan, we must generalize our previous 
discussion by repeating the steps used for the bosonic case: 

(1) First, we will give the operator product relations for the super-Kac-Moody 
algebra. 

(2) Second, we will give the operator product expansion of the coset 
construction based on these super-Kac-Moody generators. 

(3) Third, we will construct a U( 1) operator out of the fields of the super-Kac- 
Moody algebra. 

(4) Last, we will postulate a very general ansatz for the N = 2 su- 
perconformal operators in terms of the super-Kac-Moody superalgebra 
containing a large number of unspecified coefficients. We then force 
these operators to obey the correct operator product expansion for the 
N = 2 algebra. This will determine all the coefficients appearing in the 
ansatz , giving us what are called hermitian symmetric spaces. Among 
these spaces, we will find phenomenologically interesting coset construc¬ 
tions based on the gauge group E% ® E 6 without the unwanted £/(l) 
factors. 

We begin this construction by introducing, in addition to the usual 
Kac-Moody generator J A (z ), its supersymmetric partner j A {z). To cou¬ 
ple these two currents, we also introduce a Grassman variable 0 , such 
that 


j A ( Z ,e) = j A (z) + ej A (z). (5.6.1) 

Let us now write the operator product expansion of these currents 

J A (ZU 0,)J B (Z2, 0 2 ) ~ + —ifABcJ C (z 2 , e 2 ), (5.6.2) 

2(zn) z 12 

where, as usual, we define 

#12 = 0\ — 02, Z\2 = Z\ ~ Z2 — O\02- (5.6.3) 



154 5. N = 2 SUSY and Parafermions 


The previous expression is written in shorthand. If we expand the previous 
product relation in detail, we find the following three relations: 


j A (w)j B (w) 


(k/2)S AB 

w — z 




ifABcfjz) 
w — z 


J A (w)J B (z ) 


k/2S AB if abc jC(z) 

(w — z) 2 w — z 


(5.6.4) 


Let us now express the usual N = 1 superconformal algebra in terms of the 
super-Kac-Moody algebra via the Sugawara construction, as we did for the 
bosonic theory. If we define 


T{z,9)=\G{z) + eT{z), (5.6.5) 


then we have the usual operator product expansion for the N = 1 SUSY: 

T(zu 9\)T(z 2, Qf) ~ —h (z-y + - — D 2 -\ - 82 ) T(z 2 , 9 2 ), (5.6.6) 

Zn \2 Z 12 2zn z 12 ) 


where D 2 = 9/902 + 9 2 d/dz 2 - 

The important step is to use the Sugawara construction to find a repre¬ 
sentation of the iV = 1 SUSY in terms of the super-Kac-Moody generators 
via 


1 


2 i 


T(z, 9)=-: DJ A (z, 9)J A (z, 9) : +-rrf A B C : J\z, 9)J B (z, 9)J L {z , 9) : . 


3k 2 ' 


(5.6.7) 


With this Sugawara construction, we can compute the product between the 
A' = 1 SUSY generators and the super-Kac-Moody generators 


T(zu 9 x )J\z 2 , 0 2 ) ~ P^J\z 2 , 0 2 ) + -2-D 2 J\z 2 , 9 2 ) 

2 z z n 2 zi 2 

H- diJ A (z2, ^ 2 )* (5.6.8) 

Z\2 

The previous relations show that we do not yet have an acceptable Kac- 
Moody algebra. Notice that the second relation shows that the two currents, 
J A and its supersymmetric partner j B , are not independent. To make them 
truly independent, we introduce the modified field J A {z) such that 

J\z) = J A + {i/k)f ABC : j B (z)f(z) (5.6.9) 

so that J A and j B are independent, that is, J A (z)j B (w) ~ 0. 

Then, we can rewrite 

Hz) = \[ : J A {z)J\z) : - : j A (z)dj\z) : ], 

G(z) = \[j A {z)J\z) - ^ : f A B C J\z)J B (z)J c (z ): ]. 


(5.6.10) 



5.6 Supersymmetric Coset Construction 155 


The central charge emerging from this construction is given by 


where 


g = c 2 (G), 


_ k dim G 

dim G H—^-, 

k + g 

(5.6.11) 

f ACd/bCD = C 2(G)S A B, 

(5.6.12) 


and k = k — g. We have now successfully constructed the Sugawara rep¬ 
resentation of the N = l SUSY in terms of the super-Kac-Moody algebra. 
However, our goal is to implement the GKO coset construction so that we may 
have c > 3 (which is impossible for any minimal model). 

Let G have a subgroup H. Let the indices of the generators of G(H) be 
represented by A(a). Also let the generators of the coset G/H be represented 
by the indices a . Then, as usual, we have the important relationship 


cg/h = cg ~ ch- 

To be specific, we wish to split the supergenerators as follows: 

Tg(z ) = Th + Tg/h , 

Gg(z) = Gh(z) + Gq/h • 

Let us now make one more redefinition: 

J a {z) = J a {z ) + (i/k)f abc j\z)j c (z). 


Let us redefine 

T h (z) = {}/k)[j a (z)J a (z) - j a (z)dj a (z)], 

G h (z ) = ( 2/k)[j a ~J a ( z ) - ( i/3k)f abc j a (z)j b (z)j c (z )]. 
With this redefinition, we have 


1 / ~ ~ - k - - 2 i * r- 

Tg/H = ^ I J a J a ~ 3 J° + jJ a faicj b j C 


(5.6.13) 


(5.6.14) 


(5.6.15) 


(5.6.16) 


- ’-hpqfbpqf 3 j b - ^fabcfadejh f f j\ 


Gg/H — T 


f(z)J a (z ) - — f-ah-cj a (z)j b (z)j C (z ) 


We find that the Virasoro generators now close correctly, and that 


Co = \ dim G + 
cr = \ dim H + 


(k — g ) dim G 
k 

(k — h ) dim H 


(5.6.17) 


(5.6.18) 


k 



156 5. N = 2 SUSY and Parafermions 


5.7 Hermitian Spaces 


So far, we have done almost nothing new. All we have shown is how to write 
the operator product expansion of the super-Kac-Moody representation and its 
coset construction. In the next step, we add the real physics: out of the N = 1 
operators, we construct the most general ansatz for the N = 2 superconfor- 
mal generators. By demanding that these generators have the correct operator 
product expansion, we fix the values of the coefficients appearing in the ansatz, 
which then fixes the structure of the group manifold. 

We will now show how to generalize the N = 1 SUSY generators, 
given by {G(z), T(z)}, to form the N = 2 SUSY generators, given by 
[G l (z), T(z), J(z)}, where J(z) is the 1/(1) generator. 

First, we define G° = Gq/h- Second, we define G l by writing the most 
general dimension | field, which can be constructed from J a (z) and J a (z): 




h- a - b f(z)J b {z ) - ^S ab ,j 5 (z)j 6 (z)f(c) 


(5.7.1) 


where h aa and S aad are arbitrary constants, subject only to the condition that 
G l generate the usual N = 2 product expansion. 

We can also construct the U(l) current, given by 


J(z) = 7 h ab j a {z)j b {z) + J fid fid e J E (z) - 7 f E ibf(z)j h (z) 


. (5.7.2) 


The key step is to force these operators to reproduce the correct commutation 
relations of the N = 2 algebra. We find the following constraints for the 
coefficients [24]: 


^ab ^ap^pb ^ ab ’ 

hadfdbe fade^dbf 

fabc haph&q fpqc H^pHcq f pqa ~\~ ^cp^aq fpqb') 

Sdbc — haphbqhcr f pqr • 


(5.7.3) 


The first two conditions state that h aa defines an almost complex structure. 
The third and fourth conditions are satisfied if we set 


fabc = Safe - 0. (5.7.4) 

Given this last condition, we can solve for these constraints. Notice that these 
coefficients, because they determine the way that vectors contract to give other 
vectors and scalars in Eqs. (5.7.1) and (5.7.2), also determine the structure of 
the group manifold itself. Thus, forcing these operators to obey an N = 2 
superconformal algebra places constraints on the group structure of the theory. 
In particular, a close examination of these constraints shows that they give us 
what are called “hermitian symmetric spaces” [24]. 

Fortunately, mathematicians have given us a complete classification of these 
spaces [25]. It is then a simple matter to calculate the c for each of these spaces 



5.7 Hermitian Spaces 157 


and show that solutions exist for c — 9 without trivially tensoring minimal 
models together, as we did in Eq. (5.4.1). Thus, we have successfully found an 
“irreducible” c = 9 generalization to the “reducible” c — 9 theory discussed 
in Section 5.4. 

The hermitian symmetric spaces corresponding to G/H that we have found, 
with their central charges, are given by the following [24]: 


SU(n + m ) 

SU(m) 0 SU(n) ® U( 1) 
SO(n + 2) 
SO(n) <g> 50(2) 

S0( 3) 
SO (2) 

S0(2n) 

SU(n)®U(\) 

Spin) 

SU(n)®U( 1) 
E 6 

SO(IO) ® 17(1) 
Ey 

e 6 ® U( 1) 


Cg/h 

c g/h 

Cg/h 

Cg/h 

Cg/h 

cg/h 

Cg/h 


3kmn 

— , 

(fc + m + n) 


(& + n) 

3k 

(k + 2) 

3kn(n - 1) 
2(k + 2n-2) 
3 kn(n + 1) 
2(k + n + 1) 
48 k 

(k+ 12)’ 

8U 

(ifc+18)’ 


(5.7.5) 


for it = 1, 2, 3,... Notice that we have now broken past the c = 3 barrier 
found in the minimal case. 

Given this complete classification, we can now begin to look for phenomeno¬ 
logically acceptable solutions to the string equations of motion. Unfortunately, 
after compactification to four dimensions, the usual 10-dimensional Type II 
string cannot generate gauge groups large enough to include the minimal 
SU( 3) 0 SU(2 ) 0 C/(l). A careful examination of the central charges of the 
Type II theory reveals that, after compactification, the complete set of possible 
gauge groups does not include the minimal gauge group [26]. 

The alternative is to examine the heterotic string, which has a much larger 
gauge group, and use a trick exploited in Refs. 18 and 19. This allows us to 
convert a superstring theory into a heterotic one. 

Normally, modular invariant partition functions for the heterotic superstring 
are extremely difficult to construct. In fact, for the heterotic string in 10 dimen¬ 
sions, the only modular invariant combinations come from the groups E% 0 E% 
and Spin(32)/Z 2 . The difficulty arises because the left and right movers are 
treated differently in the heterotic string, while modular invariance tends to mix 
both sectors. Thus, it is a highly nontrivial result that the heterotic string has two 
possible modular invariant isospin groups. We may therefore suspect that the 



158 5. N = 2 SUSY and Parafermions 


set of modular invariant partition functions for compactified four-dimensional 
heterotic strings is extremely restrictive. 

Actually, there is a trick to create modular invariant partition functions for 
four-dimensional heterotic strings starting with modular invariant partition 
functions for the ordinary superstring in 10 dimensions. The trick is based 
upon the fact that the modular invariant partition function for the superstring 
is invariant if we make a subtle interchange between fermions and bosons. 
Specifically, by direct calculation, we can show that the modular invariant 
partition function for the superstring is invariant if we make the following 
series of transformations: 

(1) replace the character of SO(d ) with 50(24 + d ); 

(2) exchange the singlets and vectors appearing in the sum in the partition 
function; and 

(3) reverse the sign of the spinors appearing in the sum. 

We also can show that the partition function remains the same if we in¬ 
terchange SO(d ) with Eg ® 50(8 + d) in this fashion. We are particularly 
interested in the physical situation where the dimension of the transverse states 
is d = 2, which leads to a four-dimensional theory. (In the language of Calabi- 
Yau manifolds, this complicated series of transformations is identical to setting 
the spin connection and gauge connection equal to each other.) 

To use this trick, let us start with an ordinary Type II superstring in 10 
dimensions, and then compactify it to four dimensions. Because the left- and 
right-moving sectors are symmetrical, there is no problem in finding modular 
invariant partition functions in two transverse dimensions in the light cone 
gauge, that is, d = 2. Now let us use the trick of exchanging fermion and boson 
sectors for the left-moving sector only. The transverse group 50(2) becomes 
Eg ® 50(10) in the left-moving sector, while the right-moving sector remains 
the same. 

Let us take / copies of the N = 2 conformal field theory, so the basic gauge 
group is actually Eg ® 50(10)® 0(l) z , where this last 0(1 ) z factor poses 
problems at the higher loop level. However, it will turn out that, upon closer 
analysis, one of the 0(1) factors can combine with SO(d + 8) to produce 
Es^d/ 2 - In other words, the basic gauge group of the theory is 

Eg ® E 5+d/ 2 ®0(1) / - 1 (5.7.6) 

Ford = 2, we find the desirable gauge group Eg ®£ 6 , multiplied by 0(1) /_1 . 
Since the whole point of this discussion is to get rid of the extra 0(1) factors, 
the obvious choice is to choose just one N = 2 representation. In Section 
5.4, when we considered tensoring minimal models together, the choice l — 1 
was forbidden, since one minimal model by itself could not produce a c — 9 
theory. However, this restriction no longer applies for the hermitian symmetric 
case, where c — 9 is easy to obtain. Thus, this more general construction 
based on hermitian symmetric spaces gives us a phenomenologically desirable 
compactification with Eg ® E 6 , which is large enough to include the minimal 



5.8 Summary 159 


group SU(3) 0 SU(2) <g> £/(l). Thus, this new method enjoys considerable 
advantages over the earlier one based on tensoring minimal models together. 

In summary, although modular invariant heterotic partition functions are 
notoriously difficult to construct because modular transformations mix up 
left- and right-moving sectors, we have constructed a phenomenologically 
desirable compactification to four dimensions. Beginning with the ordinary 
Type II superstring compactified to four dimensions with transverse symme¬ 
try 50(2), we have flipped fermion and boson sectors in the left movers to 
obtain £ 8 ® 50(10). With l N = 2 copies, we obtained the gauge group 
Eg ® 50(10) 0 [/(iy. We then absorbed one of the 0(1) factors into 50(10) 
to obtain leaving a factor of E l 6 ~ l behind. However, for the hermitian sym¬ 
metric spaces, we have the freedom to choose / = 1 and still maintain c — 9 
(which is impossible if we tensor minimal models together). Thus, we have 
arrived at a phenomenologically acceptable compactification. This sequence 
of changes can be summarized as 

SO(2)^ £ g ®50(10)®£/(iy -* E % ®E(,®U(\)'- X -* E 8 (g>E 6 . (5.7.7) 


(There is an important physical difference between the three steps that we 
have outlined here. The first and third steps, in some sense, were implemented 
by hand. However, the second step was a consequence of implementing N = 1 
SUSY.) 

Finally, we remark that once all the constraints are in place in the theory, we 
find that the coset construction produces groups that fall into the following 10 
coset categories: 


SO(N + 2)/SO(n)®SO(2) 


(*,*) = (6, 6), (12, 4), 


SO(2n)/SU{2)®U{\) 


(n, k) = (7, 2), 


Sp(n)/SU(n)®U(l) 
U(n + m)/U(n)<g>U ( m ) 


(«,*) = (3,4), 

(n, m, k) = (1,4,15), (1, 5, 9), (1,6, 7), 
(2,2, 12), (2,3,5), (3,3,3). 


(5.7.8) 

All of these coset constructions have c = 9 and produce Eg ® Eg as the 
fundamental gauge group. Notice that all these models are free of the unwanted 
f/(l) factors, as desired, which make them phenomenologically attractive. 


5.8 Summary 

In the search for the perturbative vacuum, the Calabi-Yau manifold has 
emerged as one of the most attractive choices. Although a Calabi-Yau mani¬ 
fold is highly nonlinear, what is surprising is that a naive extension of N = 2 
superconformal models can generate such complex manifolds. 

We begin by postulating that, after compactification, the 10-manifold has 
compactified to the direct product of a four-dimensional manifold A/ 4 and a 




160 5, N = 2 SUSY and Parafermions 


six-dimensional manifold K : 


where M 4 satisfies 


Afio M4 ® K , 


(5.8.1) 


Rlivafi (/?/( 5 . 8 . 2 ) 

Additionally, we assume that N = 1 space-time supersymmetry is preserved 
after compactification. 

The key statement is that the survival of N = 1 supersymmetry leads to the 
vanishing of the variation of the gravitino field 

Hi = k~ 1 + ■ ■ • = 0 . ( 5 . 8 . 3 ) 

Thus, f is a constant covariant spinor. This statement is highly nontrivial, for 
it places enormous constraints on the structure of the manifold. In particular, 
by taking two covariant derivatives 

D v D n e ~ R uk i(r kl )€ = 0, (5.8.4) 

we see that the manifold is Ricci flat. 

Furthermore, the constant covariant spinor, which transforms under 50(6), 
can be written as a 4 of SU (4). By an SU (4) transformation, the spinor can be 
written as 


/° 

0 

0 

w 


( 5 . 8 . 5 ) 


which shows that it has 50(3) holonomy. Thus, K can be shown to be a 
Calabi-Yau manifold. In summary, we see that one of the key ingredients of 
this procedure was the assumption that N = 1 space-time supersymmetry 
survives the compactification to four dimensions. 

A second consequence of this assumption is that N = 1 space-time super- 
symmetry naturally leads to a global N = 2 superconformal symmetry on the 
world sheet. To see this, remember that the N = 1 space-time supersymmetry 
generators, after compactification, can be written as 


Q-i /2;a (z) = e-^ 2 S a 'E(z), 

, n + (j.O.O) 

QiMz) = e~* /2 S^\z), 

where an additional field 2 is required to get the counting correct. 

However, one can show that the operator product expansion of two E’s, in 
turn, generates a new field J : 

S(z)S f (u;) ~ (z - w)- 3/4 7 + (z - w) l/4 \j(z). (5.8.7) 

This new field J is precisely the U{ 1) field of the N = 2 superconformal theory. 
Furthermore, all the generators of the Af = 2 theory can be successively gen¬ 
erated in this way. Thus, quite miraculously, a hidden N = 2 superconformal 
symmetry emerges by the assumption of N = 1 space-time supersymmetry. 



5.8 Summary 161 


The N = 2 algebra is written as 

[T n , L> m ] — (ji m)L n + m ~^c{h /i)<5 n _j_ m o? 

[L n ,G i r ] = (^-r)G i n+r , 

Jn\ — ftJm+m ^ g g^ 

\Jmy Jri\ ~~ 

[J m ,G i n ] = ^G J m+n , 

{G‘, G{} = 2& ij L r + s + i€ ij (r - s)J r+s + c(r 2 - 

where the supercurrent G l is doubled by the index i. 

Just as in the bosonic case, we now construct the Verma modules of the 
N = 2 theory and look for zeros of the Kac determinant, which reads (for the 
NS sector) 


d*< m (c,M)= n (f r A s ) Mn - rs/i ’ m) 

\<rs<2n 

x n (g k y^ n ~ lklm - s '' sm;k] (5.8.9) 

ke Z+l/2 

for s even, where 

f r A s (c, h, q) = 2(c - 1 )h - q 2 - ±(c - l) 2 + |[(c - l)r + sf, g j 
g A (c, h, q) = 2h- 2kq + (c - 1 )(k 2 - ±), 

and where k e Z + 

As in the case with ordinary minimal models, we set the determinant to zero 
in order to find the unitary minimal series 

C = -^-, * + 1,2,..., (5.8.11) 

k + 2 

so that c < 3. By examining the values for ft, we find that the conformal weight 
A i tq and the f/(l) charge Q of a primary field for the minimal series are labeled 
by the integers / and q\ 


_ /(/+ 2) q 2 
l ' q 4(k + 2) 4(k + 2) ’ 


Q = 


<? 

k + 2’ 


(5.8.12) 


for / = 0, ..., k and q = —Z, —Z + 2,..., Z. 

The most naive superconformal theory is the tensoring of several of these 
minimal theories, each labeled by k { . The central charge is the sum of the 
individual central charges, which in turn must equal | times the dimension of 



162 5. N = 2 SUSY and Parafermions 


the manifold K , giving us 

c = ^t^ = K 10 -£>)- (5-8.13) 

Remarkably, this naive tensoring gives us an explicit representation of the 
Calabi-Yau manifold. If we tensor / identical superconformal field theories, 
for example, the resulting theory has a discrete symmetry. Each individual 
theory has a discrete symmetry Z k+2 , and multiplying / identical ones creates 
the permutation symmetry among them, which is 5/, so the final symmetry 
group is 

G = S l ®Z\/Z h (5.8.14) 

where the tilde represents the semidirect product. 

The number of identical fermion generations can also be calculated for this 
theory. The U( 1) charge for the product theory equals the sum of the f/(l) 
charges of the individual theories, which in turn must be — \. The total number 
of fermion generations is thus the total number of set of integers {q t }, which 
are solutions to 


52rx? = 1 ’ 0 <*<*,.. (5.8.15) 

One of the simplest theories is (3 5 ), which has 101 generations. 

Now, compare these superconformal theories to Calabi-Yau manifolds. One 
of the simplest is CP 4 , subject to 


5 

£z?=0. (5.8.16) 

1 = 1 

This surface enjoys the global symmetry 

G = S 5 ®Z 5 5 /Z 5 (5.8.17) 

and has 101 fermion generations, which strongly suggests that the CP 4 and the 
(3 5 ) theories are the same. Similarly, a large number of one-to-one correspon¬ 
dences can be made between minimal superconformal theories and Calabi-Yau 
manifolds. 

As surprising as this construction is, its main problem is the presence of 
unwanted U (1) factors in the low-energy symmetry. To verify many of the steps 
in this construction, such as constructing the characters x l q S) and the partition 
functions of the theory, it is important to have a specific representation of this 
N = 2 algebra. 

One of the most powerful is the parafermion representation. We assume 
the existence of a 0/ parafermion used in constructing Z* models in statistical 
mechanics (Z 2 is the Ising model). Then, the superconformal generators can 



5.8 Summary 163 


be written in terms of the parafermion field and a free boson 


J{z) = ‘Itt 3 *’ 

where a = y/(k + 2)/k. If we calculate the central charge of this 
representation, we find 

c = c 4 + c, = l + = (5.8.19) 

which confirms that we have a representation of the minimal series. Then, 
by introducing another parafermion field we can explicitly construct a 
representation of the primary fields 

V l m = is l m \ e iUm4>{z) : . (5.8.20) 

Last, with more work, we can construct the characters of this representation: 

Xm S) = ^ C m+4j-s( r )®2m+(4j-s)(k+2)ak(k+2)(?,2kz>u), (5.8.21) 

j mod k 

where 

z, u) = e~ 2niu £ e 2nirmj2+2nijz . (5.8.22) 

jeZ+n/2m 


Yet another compactification scheme is given by relaxing the condition of 
tensoring minimal models with c < 3. The advantage of this new scheme 
is that we avoid unnecessary factors of U{ 1) symmetry that persist in the 
naive tensoring of minimal models. This new scheme begins with an N = 1 
superconformal coset theory, and then it postulates that the N = 2 generators 
can be constructed from the N = \ currents as follows: 


G\z) 


= \ [w 


(z)J b (z) - —S d - b J a (z)j b (z)j c (c) 
3 k 


(5.8.23) 


where h- a i and S- a i~ c are arbitrary constants, subject to the condition that they 
generate the usual N = 2 product expansion operators. We also have the U( 1) 
current given by 

J(Z) = l -h- ah j\z)j\z) + \hif-ciE [> £ (z) - \fEainz)j h {z)^ . (5.8.24) 

By forcing the above operators to generate the N = 2 superconformal series, 
we find that the constants define an almost complex structure and that 


fabc S a bc 


( 5 . 8 . 25 ) 



164 5. N — 2 SUSY and Parafermions 


The solution to these constraints gives us hermitian symmetric spaces, which 
can produce c — 9 with just one irreducible representation, without tensoring 
several of them together. 

For the heterotic string, we can tensor l of these representations and find 
the gauge group E s <g> SO(IO) <g> U l 11 ~', where the last factors are 

unwanted because they cause problems at the loop level. If / = 1, then we can 
eliminate the last t/(l) by combining this factor with SO(IO), giving us E 6 . 
For the minimal models considered earlier, this choice is not possible since 
one minimal cannot give us a c = 9 theory. However, for hermitian symmetric 
spaces, there is no problem in getting c = 9 with just one copy, that is, / = 1. In 
summary, by this procedure we have now produced the gauge group £ 8 ® E 6 , 
which is large enough to contain the minimal group St/(3) ® SU (2) ® 1/(1). 

Last, we mention that the groups appearing among the hermitian symmetric 
spaces, once all the constraints are taken into consideration, are given by 


SO(N + 2)/SO(n) <g> SO(2) 
SO(2n)/SU(2) ® t/(l) 

Sp(n)/SU(n) <g> t/(l) 
U(n + m)/U(n ) <g> U(m ) 


(«,*) = (6, 6), (12, 4), 

(n, k) = (7, 2), 

(n,k) = ( 3,4), 

(n, m, k) = (1, 4, 15), (1, 5, 9), (1, 6, 7), 
(2, 2, 12), (2, 3, 5), (3, 3, 3). (5.8.26) 


References 


1. E. Cremmer and J. Scherk, Nucl. Phys. B108, 409 (1976); B118, 61 (1977). 

2. K. S. Narain, Phys. Lett. 169B, 41 (1986). 

3. W. Lerche, D. Lust, and A. N. Schellekens, Nucl. Phys. B287, All (1987). 

4. H. Kawai, D. C. Lewellen, and S.-H. Tye, Phys. Rev. Lett. 57,1832 (1986); Phys. 
Rev. D34, 3794 (1986); Nucl. Phys. B288, 1 (1987). 

5.1. Antoniadis, C. P. Bachas, and C. Kounnas, Nucl. Phys. B289, 87 (1987). 

6. L. Dixon, J. Harvey, C. Vafa, and E. Witten, Nucl. Phys. B261 ,651 (1985); B274, 
285 (1986). 

7. P. Candelas, G. Horowitz. A. Strominger, and E. Witten, Nucl. Phys. B285, 56 
(1985). 

8. S. T. Yau, Proc. Natl. Acad. Sci. 1A, 1978 (1977). 

9. A. Strominger and E. Witten, Comm. Math. Phys. 101, 341 (1985); A. Strominger, 
Phys. Rev. Lett. 55, 2547 (1985). 

10. P. Candelas and S. Kalara, Nucl. Phys. B298, 357 (1988); P. Candelas, Nucl. 
Phys. B298, 458 (1988). 

11. S. T. Yau, in Proceedings of the Argonne Symposium on Anomalies, Geometry 
and Topology, World Scientific, Singapore (1985). 

12. D. Friedan, A. Kent, S. Shenker, and E. Witten, unpublished. 

13. C. M. Hull and E. Witten, Phys. Lett. 160B, 398 (1985). 

14. T. Banks, L. J. Dixon, D. Friedan, andE. Martmec, Nucl. Phys. B299,613 (1988). 

15. A. Sen, Nucl. Phys. B278, 289 (1986); Nucl. Phys. B284, 423 (1987). 




References 165 


16. W. Boucher, D. Friedan, and A. Kent, Phys. Lett B172, 316 (1986). 

17. S. Nam, Phys. Lett 172B, 323 (1986). 

18. D. Gepner, Phys. Lett 199B, 380 (1987); Nucl. Phys. B296, 380, 757 (1987); 
Nucl. Phys. B311, 191 (1988-1989). 

19. D. Gepner, in Proceedings of the Spring School on Superstrings , Trieste, Italy, 
1989. 

20. A. B. Zamolodchikov and V. A. Fateev, Soviet Phys. JETP 62,215 (1985); Soviet 
Phys. JETP 63, 912 (1986). 

21. D. Gepner and Z. Qiu, Nucl. Phys. B285,423 (1987). 

22. J. J. Atick, L. J. Dixon, and A. Sen, Nucl. Phys. B292, 109 (1987). 

23. M. Dine, I. Ichinose, andN. Seiberg, Nucl. Phys. B292, 253 (1987). 

24. Y. Kazama and H. Suzuki, Phys. Lett. 216B, 112 (1989); Nucl. Phys. B321,232 
(1989). 

25. J. A. Wolf, in Symmetric Space , Dekker, New York (1972); Spaces of Constant 
Curvature , Publish or Perish, Berkeley (1984). 

26. L. Dixon, V. Kaplunovsky, and C. Vafa, Nucl. Phys. B294, 43 (1987). 



CHAPTER 6 


Yang-Baxter Relation 


6.1 Statistical Mechanics and Critical Exponents 

Throughout the previous chapters, we have seen the close relationship between 
conformal field theory and two-dimensional statistical mechanics. In fact, at 
criticality, the detailed behavior of a statistical mechanical system gets washed 
out, and universality sets in. Since we have a complete classification of certain 
classes of conformal field theories, we should be able to catalog the models 
of statistical mechanics at criticality according to known representations of 
conformal field theories. 

Before we proceed with a discussion of classifying conformal field theories, 
it thus becomes important to analyze this remarkable relationship in more 
detail. In this chapter, we make this relationship between conformal field theory 
and critical systems explicit, and we also point out the origin of why these 
statistical mechanical models are exactly solvable, that is, the Yang-Baxter 
relationship [1,2]. Surprisingly, we will see the Yang-Baxter relation crop up 
in numerous other ways in later chapters, such as in our discussion on knot 
theory and Chem-Simons Yang-Mills theory. 

Let us begin our discussion of statistical mechanics by making a few basic 
definitions [3-6]. When analyzing the properties of a solid, liquid, or gas, the 
starting point of our discussion will be the Boltzmann partition function 


z = j2 ex p 

n 


E(n) ' 

kT 


( 6 . 1 . 1 ) 


where E(n) represents the energy of the nth state, k represents the Boltzmann 
constant, and T represents the temperature. 



6.1 Statistical Mechanics and Critical Exponents 167 


Notice the similarity between this partition function and the generating 
functional found in relativistic quantum field theory 

Z = j Df expi / L((f>) d 4 x. (6.1.2) 

Notice that there is a correspondence between the two widely divergent for¬ 
malisms if we take the Euclidian version of quantum theory (so that the factor 
of i in the exponent becomes —1). In particular, the high-(low-)temperature 
limit found in statistical mechanics corresponds to the weak (strong) coupling 
limit of quantum field theory. 

The fundamental quantity we wish to calculate for any statistical mechanical 
system is called the free energy, and it is defined by 

F = —kT InZ. (6.1.3) 


In addition, the statistical average of any observable X is given by 

(X)-r'^X(n)ex p[-fr ‘ 

n L 


(6.1.4) 


We say that a two-dimensional statistical model is exactly solvable if we can 
solve for an explicit expression for the free energy. There are remarkably few 
exactly solvable two-dimensional models, such as the Ising model, ferroelectric 
six-vertex model, eight-vertex model, three-spin model, and hard hexagon 
model. 

Let us say that we have a collection of spins 0 [ arranged in some regular 
two-dimensional lattice. Then, define the correlation between the ith and j th 
spin as 


gij = {Vi<Jj) - (cri)(crj). (6.1.5) 

In general, we find that the function g t j will depend on the distance x separating 
the states, and at large distances, it will behave like some decreasing power of 
x multiplied by some exponential 

gij ~ x~ z e~ x/ (6.1.6) 


where § is called the correlation length . 

At the critical temperature, we find that the correlation length becomes 
infinite, that is, the system loses all dependence on any fundamental length 
scale, so the correlation function exhibits a power behavior 

gij ^x-^ 2 - 11 (6.1.7) 

where rj is called a critical exponent and A is the conformal weight of the field. 
Likewise, one can also define the “energy operator” as a product of two fields, 
€ n = cr n cr n +i, whose critical behavior is governed by another critical exponent 
v: 


(£«£o} 


x -2{d-\/v) 


( 6 . 1 . 8 ) 



168 6. Yang-Baxter Relation 


For the Ising model, we can actually compute these critical exponents for the 
spin field and the energy field, and we find 

ri = \, v = l. (6.1.9) 

Because A, the conformal weight of the field, can be written as h + h for 
the minimal model, we can write the correspondence between Ising fields and 
the minimal model for m == 3: 


a ** 01 / 16 , 1 / 16 ? £** 01 / 2 , 1 / 2 - ( 6 . 1 . 10 ) 

Because the correlation length becomes infinite at criticality, the properties 
of the system can be roughly described by the critical exponents for various 
physical quantities. 

For systems with a magnetic field, for example, the magnetization M is 
defined to be the average of the magnetic moment per site 

M(H, T) = N~ l (a, + • • • + ctff), (6.1.11) 

where T is the temperature of the system, H is proportional to the magnetic 
field, and the energy is given by 

E = E 0 + Hj2^n, (6.1.12) 


where E 0 is the energy in the free field limit H = 0. 

In the limit that N —> oo, we can describe the magnetization as 


M(H,T) = ~f(H,T). (6.1.13) 

because taking the derivative with respect to H simply brings down o, into the 
sum. The susceptibility of a magnet is then defined as: 


X(H, T ) = 


8M(H, T) 

a H 


(6.1.14) 


Using this formalism, we will make contact with the conformal field theories 
described in previous chapters. 


6.2 One-Dimensional Ising Model 

Let us first solve the simplest system, the one-dimensional Ising model. This 
theory is not very physical because it exhibits no phase transition at all, but it has 
many of the mathematical ingredients useful for more complicated systems. 

We begin by placing a series of spins a t along a line, which can take the 
values of ±1. The energy of the system can be described as 

N N 

E(cr) = -J '^2,OjOj + 1 - H 

j =i J =i 


( 6 . 2 . 1 ) 



6.2 One-Dimensional Ising Model 169 


where we assume that the j th spin only interacts with its nearest neighbors at 
the j — 1 and j + 1 sites. Then, the partition function can be written as 

( N N \ 

a J a y+i + ^ ^ a i ) ’ (6.2.2) 

7—1 7=1 / 

where we have rescaled the parameters via K = J/kT and h = H/kT. 

Let us make a most important observation about this system, by rewriting 
the partition function as a sum over a series of matrices 

Zn = ^2 V ( a ^ a 2 )V(a 2 , o 3 ) ■ ■ ■ V(a N -i,a N )V{a N , a x ), (6.2.3) 


where 


V{cr, a') = exp 


Kao + -(a + a) 


(6.2.4) 


Now, let us regard the elements of V as a two-by-two-dimensional matrix 
V, which is called the transfer matrix, which depends on whether the spins are 
+1 or —1, that is. 


V - 


fv(+, +) 


V(+, -)\ _ (e K+h 
V(-,-V “ (e-* 



(6.2.5) 


Therefore, the partition function can now be succinctly rewritten as 


Z N = TrV". 


( 6 . 2 . 6 ) 


On one hand, we have done nothing. We have merely reshuffled the summa¬ 
tion within Z N by rewriting it as a sum over the two-by-two transfer matrix V. 
On the other hand, we have made an enormous conceptual difference, because 
we can now diagonalize the transfer matrix in terms of its eigenvalues, that is, 
there exists a matrix P that diagonalizes V: 

v = p (o' X 2 ) P " < 6 ' 27 > 

Substituting this into our original expression for the partition function, we 
now find 



Let A.i be the larger of the two eigenvalues, which will then dominate the sum 
in the limit as N -* oo. We then have 


f(H, T) = -kT lim AT 1 lnZ N = -kT lnAj 

N—*oo 

= — kT In |V cosh h + \!e 1K sinh 2 h + e~ 2K j . (6.2.9) 



170 6. Yang-Baxter Relation 


In addition to having an exact expression for the free energy, we also have 
an exact expression for the magnetization 


M(H, T ) = 


e K sinh h 

\Je 1K sinh 2 h + e~ 2K 


( 6 . 2 . 10 ) 


This is a truly remarkable result, first found by Ising, who also proposed the 
model in 1925 [7]. Because we have an analytic expression for the free energy 
in terms of K and h, it shows that the model is exactly solvable. Moreover, it 
also shows, unfortunately, that the system does not exhibit any phase transition 
for any positive temperature. 

Because we have an exact expression for the transfer matrix, we can now 
solve for the correlation length and show that it goes to infinity when H = 
T — 0, although this latter point is not a critical point. To do this, we need to 
calculate the averages (or,) and (aiOj). We begin by defining the matrix S in 
spin space as 

»-(i -0 <6 - 2ii) 

which has elements 

S(cr, o') = <j8(cr , or'). (6.2.12) 


Therefore, the average can be written as 


So, 


<ai<7 3 ) = Z N l ^<Ti V(a u cr 2 )V(a 2 , ct 3 )ct 3 • • • = Z N X TrSV 2 SV w 2 . 

(6.2.13) 


(ow) = Z N X Tr SX J ~‘SV N+l - J , 
(at) = Z N ] TtS\ n . 


(6.2.14) 


Now let the matrix P, which diagonalizes the transfer matrix, be parametrized 
by an angle <f>: 


( cos (j> — sin 0 

sin 0 cos 0 


(6.2.15) 


Then, we have 


gij = (OiOj ) - {o t ){<7 j) = cos 2 <p + sin 2 2<p(X 2 /'ki) 1 ' - cos2 <f> 


= sin 2 20(X 2 /A 1 ) J ' ' 


(6.2.16) 


So, we have the desired result 


£ = [ln(*iA 2 )] ( 6 - 217 ) 

which tends to oo as H,T —»■ 0. Thus, all reference to a mass scale has 
disappeared. 



6.3 Two-Dimensional Ising Model 171 


6.3 Two-Dimensional Ising Model 

Now that we have some experience using the transfer matrix technique, let us 
tackle a nontrivial problem, the two-dimensional Ising model [ 8 ]. 

We place the spins < 7 / on a two-dimensional lattice, except that the lattice 
sites are placed diagonally. Let W and V represent the lattice sites that are 
arranged along a horizontal line. The sites along W and V alternate as we 
descend the lattice. Our strategy is to rewrite the partition function once again 
in terms of transfer matrices, except that we will perform the sums over spins 
in a particular fashion. 

First, we will sum the spin horizontally, which will give us expressions for 
W and V. Then, we will sum the lattice vertically, which will give us sums 
over the product WV W V W ,..., etc. 

To sum the lattice sites horizontally, let 0 = {cri,cr 2 ,.. .< 7 n }, that is, the 
lattice sites arranged horizontally along the top of Fig. 6.1, and let 0' be the 
lattice sites arranged below them. Then, define W and V as follows: 


V*, 0 / = exp 
Ww = exp 


Y^(Ka j+x o'j + LajCfj) 


U=i 


+ La i a 'j+ 1 ) 

U=i 


(6.3.1) 


where W and V are now 2 n x 2 n matrices. As before, we can perform the sum 
over the two transfer matrices by summing vertically over the lattice: 


2 ” 

Z„ = Tr(VW ) m/2 = (6.3.2) 

1 = 1 

In the thermodynamic limit, as we let the number of points n, m —» oo, 
the partition function is once again dominated by the largest eigenvalue of the 
transfer matrix VW: 


lim Z ~ (A ma x) m - 

n,m—*oo 

We see that the one-dimensional and two-dimensional Ising models are there¬ 
fore closely related to each other and that the calculation of the free energy 
reduces to calculating the largest eigenvalue of the transfer matrix. 

The actual solution for the free energy in the continuum limit, however, is 
quite involved for the two-dimensional Ising model, so we will just present the 
final result. Define a function 


F(6) = In [ 2 ( cosh 2 K cosh 2 L + jfc“Vl + * 2 - 2K cos26>) . (6.3.3) 



172 6. Yang-Baxter Relation 



Then, the largest eigenvalue can be written as 

< 6 - 34 > 

In the thermodynamic limit, the summation over the evenly spaced 0 l becomes 
an integral, so we can write [6]: 

kT C 71 

f = ~2^Jo F(e)d9 * (6 * 3 * 5) 

This is our final result for the free energy of the two-dimensional Ising model. 


6.4 RSOS and Other Models 

There are a number of models [2-6] that generalize the behavior of the Ising 
model and are exactly solvable. More important, there are a number of models 
that, although they may not be exactly solvable, exhibit critical behavior that 
can be described by the known conformal field theories. Let us list some of 
these models and their properties. 

Spherical Model 

One defect of the Ising model is that it is only solvable in the zero external 
field limit, which is a feature of many ferromagnetic models. However, one 
model that can be solved exactly, even in the presence of a field, is the spherical 
model. 

This model is similar to the Ising model, except for several important dif¬ 
ferences. The spin a t can take on real values, not just +1 and —1, and it can 
interact with all spins in the lattice, subject to the constraint 

N 

£»•/ = "• 

j =i 


(6.4.1) 



6.4 RSOS and Other Models 173 


The partition function is now replaced by an integral (not a sum) of the Ising 
model’s partition function, with a delta function insertion that guarantees the 
constraint 

/ co roc 

• I d<j\ • • • da N 

-oo j —00 

x exp ^ £ojat + h 5 ^ - £ oj'j . (6.4.2) 

This model may be criticized because it is unphysical, that is, it implies a 
coupling between all spins on the lattice, no matter how far apart they are. 
However, this model is exactly solvable and exhibits normal phase transitions, 
despite its unphysical appearance. This puzzling result has been explained: the 
spherical model has been shown to be a special limiting case of the n -vertex 
model with only nearest neighboring interactions. 


Ice-Type, Six-Vertex Model 

The ice-type model was introduced to model the behavior of ferroelectrics. 
It differs from the usual Ising model, whose spins are located at the lattice 
sites, because the energy of the ice-type model is defined on the links or edges 
connecting the sites, not on the sites themselves. 

Ice-type models may describe the behavior of crystals with hydrogen bonds, 
such as ice, with oxygen and hydrogen atoms within the water molecule. We as¬ 
sume that the molecule is placed at each lattice site and that the edges represent 
the electric dipole field, and hence, the model represents a ferroelectric. 

In general, since the electric dipole can assume two directions along the 
edges, there are 2 4 = 16 ways in which arrows can be placed along the edges, 
arranged around a lattice site. To be more concrete, we assume the ice rule, 
which states that there must be two arrows going into, and two arrows going 
out, of each site. Thus, the 16 possible configurations of arrows surrounding 
each site is reduced to six (see Fig. 6.2). 

Let €i represent the energy associated with each of the six possible configu¬ 
rations of arrows surrounding each site. Let n t be the number of times the zth 
configuration is repeated throughout the lattice. Then, the partition function is 
represented by 


For different values of , we have different physical structures. For example, 
it is thought that the potassium dihydrogen phosphate crystal, KH 2 P0 4 , can 
be represented by the following choice: 


£\=£i = 0 , 


^3 — £4 — £5 — ^6 > 0* 


(6.4.4) 



174 6. Yang-Baxter Relation 


FIGURE 6.2. 

On the other hand, it is thought that an antiferroelectric can be modeled by the 
choice: 


€\ = € 2 = €3 = €4 > 0, €5 = e 6 = 0. (6.4.5) 

Because the ice-type, six-vertex model is exactly solvable, we can find ana¬ 
lytic expressions for its free energy and solve for its critical exponents. There 
are, however, some defects with the six-vertex model. It turns out that the model 
has a ferroelectric ordered state that is frozen, that is, the ordering is complete 
even at nonzero temperatures, and that the antiferroelectric properties do not 
diverge or vanish at criticality as simple powers of the critical temperature. 


Eight-Vertex Model 

Because of the oversimplification present in the six-vertex model, leading to 
nonphysical results, it was generalized to the eight-vertex model. This model 
places constraints on the 16 possible configurations of arrows surrounding a 
lattice site by assuming that there are an even number of arrows going into and 
out of each site. Thus, we will sum over eight possible configurations, each 
with its own energy 
The partition function is 



Like the six-vertex model, the eight-vertex model is also exactly solvable. 
However, the eight-vertex model is considerably more sophisticated than the 
six-vertex model. In fact, the eight-vertex model, for various choices of the 
physical parameters, can describe both ferromagnets as well as ferroelectrics. 
In addition, it can contain the six-vertex model and the Ising model as special 
cases. 

The first statement is easy to understand, because we can always set two of 
the €( to zero to obtain the six-vertex model. However, the second statement 


6.4 RSOS and Other Models 175 


is surprising, because the eight-vertex model and the Ising model have very 
different physical structures. The vertex models have their energy based on the 
edges connecting the sites, while the Ising model has its energy based on the 
sites themselves. 

The fact that the eight-vertex model contains the Ising model, however, can 
be seen by a change of parameters. Let be associated with the edge that 
links the /th and j th lattice site, and let it assume values of -hi or —1. Now 
rewrite the energy as follows: 

M N 

E — ^ ^ ^ ^ 1 “h 7 hj&i + Lj JGiJ + \Gi + \J 

;=i 

T“ 7 j®i+\J+\ + J &ij+ (6.4.7) 

Notice that we have done nothing; we have merely rewritten the eight-vertex 
model in a way such that its dependence on Ising-type spins is more apparent. 
Now define 

otij — <j {, [Xij — Gijcfi+i j . (6.4.8) 

Then, the partition function can be written as 

M N 

£ = -EE (JyOtij “I" JhU'ij 4” JotijfAij 

i =1 7 = 1 

+ J'cii+ijiiij + J"aija i+lj ) (6.4.9) 

with the condition 

liijaijOCi+ijiiij+i = 1. (6.4.10) 

Notice that we have now split the partition function into two pieces, each 
representing a distinct Ising model. 

In fact, the explicit relations between the eight-vertex e, and the /’s of the 
Ising model are given by 

€\ — —Jh — J v — J — J — J , €2 = Jh + Jv ~ J ~ J ~ J 1 

63 = —Jh J v -\- J J' J , €4 — Jh — J v J J — 7 » (6.4.11) 

65 — 66 = 7 — 7 + 7", 67 = 68 = —J + J f + J". 

With this choice, we can show that the partition function of the Ising model is 

just twice the partition function of the eight-vertex model 

Rising = ^^eight-vertex (6.4.12) 


Z N Model 

Potts originally introduced two different types of models, the Z N model and 
what is usually called the Potts model. The Z N model is a straightforward 
generalization of the usual Ising model. The spins in the Ising model assume 



176 6. Yang-Baxter Relation 


only the values of +1 or —1. However, we can easily generalize this to the 
case where the spin o t points in N equally spaced directions. Then, the energy 
associated with the model is the scalar product between nearest neighbors, that 
is, 

E = X] H ■ en,j +1 + <Ti,j ■ <T i+i ,j }• (6.4.13) 

ij 

Obviously, the Ising model corresponds to the case of N = 2. A parafermionic 
representation of the primary fields of this model was studied in Chapter 5. 


Potts Model 


The Potts model is defined by letting the spin o t at the zth lattice site take on 
values from 1 to q. Two nearest neighbor spins interact via the delta function 
and are defined as 


5(<r, o') = 1 if a — o', 

S(o, o') = 0 if o ^ o'. 

The energy is then defined to be 

E = - J ^5(a, , Oj). 


(6.4.14) 


(6.4.15) 


This model can be solved at criticality. The case q = 1 is trivial. The case 
q = 2 is the Ising model, which is equivalent to a minimal model with m = 3. 
It can be shown to have critical exponents given by 

a=0 9 0 = 5 = 15, (6.4.16) 

which can then be compared with the minimal model. The case q = 3 is also 

equivalent to a minimal model. Its critical exponents are given by 

a = \, P = \, S — 14. (6.4.17) 


XYZ Heisenberg Model 

Closely related to the Ising and the eight-vertex models is the XYZ Heisenberg 
model. Here, we replace the spin o l with a real Pauli spinor. The Hamiltonian 
is given by 

H = ~ 1 +•••}. (6-4.18) 

z j =i 

where the ellipses represent the interactions in the vertical direction. 

Not only is o t a Pauli spinor, it also carries the indices of all the spins in the 
system, that is, 

Oj = 1 <g> • • • (8) o x 0 • • • 0 1 , 


(6.4.19) 



6.4 RSOS and Other Models 177 


where 1 is a 2 x 2 unit matrix, and the only nontrivial entry in the tensor 
product is at the y th site. 

If J x = J y — J z , then this is the usual Heisenberg model. 

If J x — J y = 0, then only J z survives, and hence, we obtain the usual Ising 
model. 

If J z = 0, then we have the XY model. 

If J x = J y9 then we have the Heisenberg-Ising model. 

It can be shown that the Hamiltonian, for any value of the J ’s, can be written 
as the logarithmic derivative of an eight-vertex transfer matrix. 


Ashkin-Teller Model 

The Ashkin-Teller model, like the previous models, was based on a generaliza¬ 
tion of the Ising model. In this model, there are four types of atoms, called A, 
B, C, and D. There are three values of the energy 6 ,, given the different possible 
nearest neighbor pairings of these atoms. The following energies correspond 
to the given pairings 

e 0 : AA, BB , CC, DD , : AB, CD , 

(6.4.zU) 

€2 : AC, BD , 63 : AD, BC . v 7 

It is possible however, to rewrite this model in terms of the usual Ising-type 
spins. Let us introduce two types of spins, Sj and 07 . Let the pair { 5 /, <r z } equal 
(+, +) if there is an A atom at any site /; (+, —) if there is a B atom; (—, +) 
if there is a C atom; and (—, —) if there is a D atom. Then, the energy can be 
written as 


E(ij) = —JsiSj — J'oiCfj — J4S1 s j <7j — J 0 , 

where 

-J = (€ 0 +6i - 62 -€ 3 )/4, 
-J f = ( 6 0 + 6 2 - 63 - 60 / 4 , 
-J 4 = O 0 + 63 - 61 - € 2 )/ 4 , 
— J 0 = (e 0 + + 6 2 + 63)74. 


(6.4.21) 


(6.4.22) 


(There is yet another representation of the Ashkin-Teller model, as a staggered 
eight-vertex model.) 

The Ashkin-Teller model is not solvable, but its properties at criticality are 
known. It is known that its phase structure is surprisingly rich. It has five phases, 
including phases that are ferromagnetically and antiferromagnetically ordered, 
and one that corresponds to the m = 4 superconformal unitary minimal series 
at criticality. 



178 6. Yang-Baxter Relation 


Hard Hexagon Model 

The hard hexagon model is exactly solvable, and it represents a two- 
dimensional lattice model of a gas of hard, that is, nonoverlapping, molecules. 
For example, it can be compared to a two-dimensional helium monolayer 
adsorbed onto a graphite surface. 

Imagine that our lattice consists of an infinite series of hexagons, each ad¬ 
jacent to each other and without any spaces in between. The only rule is that 
a particle may occupy the center of each hexagon. The model is hard, that is, 
the hexagons do not overlap. The partition function is then defined to be 

N /3 

Z = J2z n g(n,N), (6.4.23) 

n= 0 

where g(n , N) is the number of ways in which n particles can be placed in 
each of the various hexagons. There are N sites, and hence, at maximum, only 
N/3 sites can be occupied. 

RSOS 

One of the most general solvable models is the RSOS (restricted solid-on- 
solid) model, which is directly related at criticality to the infinite sequence of 
minimal models found in Chapter 2, as we saw in Eq. (2.5.24). The RSOS 
model is defined by the plaquettes (squares). At each site i in a square lattice, 
define an integer , which represents the “height” of that point. The height is 
restricted to the interval 


1 < U < (r - 1) (6.4.24) 

for a fixed integer r (r > 4). (For the unrestricted RSOS, the value of /*• has no 
restrictions, that is, — oo < /, < oo.) The relative heights of nearest neighbor 
sites can only differ by unity, that is, 

\li-lj\ = l (6.4.25) 

if i and j are nearest neighbors. 

To each plaquette, assign a Boltzmann weight W, which has the symmetries 
W(l u l 2 J 3 J 4 ) = W(l 3 J 2 J u l 4 ) 

= w(i\, u, / 3 , / 2 ) 

= W(r — Zi, r — / 2 , r — 1$, r — U). (6.4.26) 

The partition function is then given by the product of the weights 

Z = £ n WihJjJmJn), (6.4.27) 

i,j,n,m 

where the sum is over all allowed arrangements of heights on the lattice, and 
the product is over all faces of the lattice. 



6.5 Yang-Baxter Relation 179 


The RSOS model exhibits several phases, and contains a wide variety of 
known models as specific examples. For example, if r is even, we can translate 
it into an Ising-type model by changing to spin variables 


Si = (r - 2/|)/4 (6.4.28) 

which produces spin(r — 2)/r Ising spins on an odd lattice and spin(r — 4)/4 
on an even one. 

By comparing the critical exponents of the RSOS model with those found 
for the minimal model, we can show that the correspondence between the two 
models is established for 


r = m- hi, (6.4.29) 

that is, the m = 3 minimal model, the r = 4 RSOS model, and the Ising model 
at criticality are all the same. For higher values of r, the critical exponents of 
the RSOS model can be shown to include those of the Ashkin-Teller model 
and the hard hexagon model. In particular, we have the series in Eq. (2.5.24). 


6.5 Yang-Baxter Relation 

It is possible to bring some order to the rapid proliferation of models. We find 
that most of these models fall into two types: 

(1) vertex models; and 

(2) IRF (interaction around a face) models. 

The vertex models are like the ones we have studied, where the energy 
is defined by the arrows on the edges that surround a given site. The IRF 
models, like the Ising models, have spins located at each lattice site, with 
nearest neighbor interactions. If we take one plaquette of a lattice, we can 
place the spins around the comers of the plaquette, hence the name. 

It will turn out that the reason for the exact solvability of these models 
is that the transfer matrices, which define the partition function and free en¬ 
ergy, commute. When expressed mathematically, this relationship becomes the 
celebrated Yang-Baxter relation. In fact, mutually commuting transfer matri¬ 
ces, or equivalently the Yang-Baxter relation, are sufficient conditions for the 
solvability of any two-dimensional model. 

To get a better understanding of the Yang-Baxter relation, let us study the 
ice-type six-vertex model. For example, in ice, we have the molecules of water 
held together by electric dipole moments. Let us place water molecules on a 
square two-dimensional lattice, such that the line segments forming the lattice 
correspond to the electric fields, represented by arrows. 

These arrows only have two directions on any given line segment. Thus, 
from any lattice site, there are six different possible orientations of the arrows. 
Each of these six different orientations will have an energy associated with 



180 6. Yang-Baxter Relation 


it, called for i = 1,2,..., 6. Thus, if <f> represents the lattice sites along a 
horizontal line, then we have 


z = J2 E • • • E VU>u<h)V(<l> 2 , <h) ■ ■ ■ </>i) = Tr V M , (6.5.1) 

01 02 0M 


where 


= x>p 


(m i^i +^2^2 -1 - \-m(,e 6 y 

kT . 


(6.5.2) 


One can proceed to solve the system in this fashion. However, for our purposes, 
let us propose an alternative method in which we see the origin of the Yang- 
Baxter equation. 

The partition function can be totally rewritten in terms of 

w(i, j\k, l ) = exp [ - e(i, j, k, l)/kT], (6.5.3) 

Different values of e(i, j, k, l ) correspond to different models. 

Now, let a, |ft, M;+i) represent the contribution to the sum from the 
2 th site. Each Greek index, in turn, can have values of ±1, such that there are 
only six possible orientations. Let us perform the sum horizontally, as before 

= ^•••^w(/ii,ai|/6i,/i 2 )u2(/i2,«2l^2,M3)- (6.5.4) 


Let V f represent another transfer matrix, except that 

N 

(VV r ) a ^ = V ay Vy p = |”[ VilVi+u V/ + i|a/, Pi), 

y vy-v N i =1 

(6.5.5) 

where 

S(M, v|ft, v|a, = w(fi,a\y, (i')w'(v, y |ft v'). (6.5.6) 

Y 

We can therefore write 

(VV%, P = Tr S(a,, ft)S(c* 2 , ft) • • • S(oc N , ft*), $ 

(V’V) aJ = Tr S'(«i, ft)S'(« 2 , ft) • • • S'(q N , ftv). 

We wish to show that V and V' commute, so that the two previous expres¬ 
sions are identical. This is obviously possible if there exists a four-by-four 
matrix M such that 


S(a, 0) = MS'(a. ftM” 1 . 


(6.5.8) 



6.5 Yang-Baxter Relation 181 


q q 




FIGURE 6.3. 


Let us multiply the previous relation from the right by M, which we can also 
represent as a matrix called w". Then, we have the relationship 

22 w(fi, a\y, fi")w'(y, y\0, v")w"(v”, n"\v', n') 

— w”(v, fi\v", fi")w'(iJ,”,a\y, n')w(v",y\p,v'). (6.5.9) 

y,H”v" 

If we redefine 

w{n, a, 11", y) = S£f(u), w\v, y, v", 0) = S^'(u + v), 

„ . (6.5.10) 

w'Xv", v') = SyAv'), 

then we can write the Yang-Baxter relationship in the form 

£ S%(u)S a k p y {u + v)Sp(v) = 22 S iy( v )Sy r (u + v)s; p q (u). (6.5.11) 

Otpy apy 

If we graphically represent this relationship, then we find the pattern expressed 
in Fig. 6.3, which pictorially displays the Yang-Baxter relation. 

The second type of exactly solvable model is the IRF, which includes the 
Ising model and many of the other exactly solvable models. (However, we 
should stress that, as in the eight-vertex model, there are ways in which certain 
models can be formulated in both languages.) If we place four spins a,b,c , and 
d (which can equal +1 or 0) around the four comers of a plaquette, the energy 
associated with the plaquette will be b , c , d\ so we define the Boltzmann 

weight of the plaquette as 


w(a, b , c, d) = exp[— e(a, b , c, d)/kT]. 


(6.5.12) 



182 6. Yang-Baxter Relation 


For different choices of e(a, b, c, d ), we can represent a wide variety of 
models. For example, the Ising model can be represented as 

e(a, b, c, d ) = -\J[(2a - 1)(2 b - 1) + (2c - 1)(2 d - 1)] 

- \j[(2c - 1)(2 b - 1) + (2d - 1)(2 a - 1)], (6.5.13) 

and the eight-vertex model can be written as 

e(a, b , c 9 d ) - —/(2a - l)(2c - 1) - J\2b - 1)(2 d - 1) 

- J 4 (2a - 1)(2 b - l)(2c - 1)(2 d - 1) 

fora, b,c,d = 0, 1. 

The Hamiltonian is then represented as 

H - 'Y^e(o i ,Oj,o k ,ai) 

faces 

and the partition function for the IRF model is given by 
Z = £-£ PI w(<Ji , cr j , a k , Oi). 

o\ oh i,j,k,l 

We will now repeat the same steps that we used in studying the Ising model. 
We wish to express the partition fimction as a trace over the transfer matrix 
and then isolate the condition for commuting transfer matrices. Let us define 
the partial sum 

n 

For or' =n Oy-t-1 > Cfj ), 

7 = 1 

where the sum over o is shorthand for 

a = {cri,a 2 , 
a' = 

and ff„+i = cri and c^ +1 = ct(. 

We similarly define V' by replacing w with w'\ 

n 

v °°'= n “'Vj. ^+i. 

7=1 

This allows us to form the product V V' defined by 


(6.5.17) 


(6.5.18) 


(6.5.19) 


(6.5.14) 

(6.5.15) 

(6.5.16) 


(W'W = X) = EI1 X{G r 1. <+i. 

or" cr" 7 = 1 

(6.5.20) 

where we introduce the quantity 

X(a, b, c\a\ b\ c') = w(a , a', b', b)w'(b, b\ c\ c). (6.5.21) 



6.5 Yang-Baxter Relation 183 



FIGURE 6.4. 


The whole point of performing this decomposition is to be able to write the 
sums as traces over transfer matrices 

(W )<ra' = Tr X((T\, (T| |<72, O r 2)Y(<72, 0 -^ 0 %, CTj) • • • X(<J n , <j' n \ 0 \, ffj). 

(6.5.22) 

Similarly, we now define X' with w and w' interchanged 

{V'y)aa' — Tr X'(p\,o{\ 02 , <?l)X'{ 02 , CT^ |<T 3 , CTj) • • • X'(o n , cr^\a u a[). 

(6.5.23) 

As usual, we find that, for the transfer matrices V and V' to commute, we 
need to postulate the existence of an M matrix, such that 

X(a, a'\b, b') = M(a, a')X'(a, a'\b, b')M{b, b')~ l ■ (6.5.24) 

Multiplying by M from the right, we now find that the condition for commuting 
transfer matrices is 

Y w(b , d, c, a)w'(a, c, f, g)w"(c, d, e, /) 

= Y! w ( a ’ c ’ sW(b, d, e, c)w(c, e , /, g ), (6.5.25) 

C 

where we have defined M(a , a') as w"(c 9 a , a'). 

This is now the Yang—Baxter relation for the IRF model. In Fig. 6.4, we have 
graphically displayed the structure of the Yang-Baxter relation, which differs 
only in form from the Yang-Baxter relation obtained from the vertex models 
in Eq. (6.5.10). (Because of the shape of this graph, this equation also goes by 
the name “star-triangle” relation.) 

Now that we have derived the Yang-Baxter relationship for both the vertex 
models and the IRF models, we have an alternative method of solving these 
models. Instead of trying to maximize the eigenvalues of the transfer matrices, 
which is how the Ising model was historically solved, we solve the Yang-Baxter 
relation directly. 

This second approach to solving statistical mechanical models is much 
more elegant than the brute force, hit-or-miss methods employed over the 
past decades. In fact, the method is so powerful that we can even see how 



184 6. Yang-Baxter Relation 


new infinite classes of models might be solved by looking for solutions of the 
Yang-Baxter relation. 

The trick behind solving the Yang-Baxter relation is to reduce the Boltzmann 
weight function w(a , b, c , d) to a few independent parameters, and reexpress 
the Yang-Baxter relation in terms of this set. Then, we notice that these rela¬ 
tions are identical to the addition formulas found in ordinary trigonometry or 
the theory of theta functions. The solution to the Yang-Baxter equation can be 
given in terms of known analytic functions satisfying these addition formulas. 
Once this analytic solution to the Yang-Baxter relation is found, we can insert 
this into the partition function and calculate the free energy. 

Example: Ising Model 

Let us illustrate this procedure for the Ising model. The partition function in 
Eq. (6.5.12) can be written in terms of four independent functions coi: 


co\(u) = w( 2, 3, 2, 1; u) = w( 2, 1,2, 3; w), 

co 2 {u) = w{ 2, 1, 2, 1; u) = w{ 2, 3,2, 3; w), 

a) 3 (u) = w( 1,2, 3, 2; u) = w( 3, 2, 1, 2;«), 

co 4 (u) = w( 1,2, 1,2; u) — w( 3, 2, 3, 2; w). 


Inserting this into Eq. (6.5.25), we find the Yang-Baxter equation simplifies 
and reduces to the following equations for the Boltzmann functions 

co 4 (u)co 2 (u + v)o) 4 (v) - 1 - (Di(u)coi(u + v)co 3 (v) — co 2 (v)co 4 (u + v)co 2 (u ), 

c0 4 (u)(JL>\{u + v)(o 4 (v) + 0) 3 (u)a) 2 (u + v)co 3 (v) = coi(v)co 4 (u + v)(Di(u), 

co 4 (u)co 2 (u + v)co 3 (u) + (o 3 (u)coi(u + v)co 4 (v) = co 2 (u)co 3 (u + v)a>i(u). 

(6.5.27) 

The key step is to notice the similarity between these equations and the addition 
formulas found in the classical theory of theta functions. This is how we will 
find a solution to these reduced Yang-Baxter equations. 

The addition formulas for the theta functions are 

(u + x)&\(u - x)&\(v + y)tf(u — y) 

— (u + r)$i(u — y)#i(v 4- x)&\(v — x) 

= &i(u + v)&i (u - u)i?i(x 4- y)$\(x - y), 

$ 4 (u + x)$ 4 (u - x)tf 4 (u 4- y)$ 4 (v — y) 

— $ 4 (u 4- y)tf 4 (w — y)tf 4 (u 4- x)ft 4 (v — x) (6.5.28) 

= —#1 (u + v)$i(u - v)$\(x + y)tf i(x - y), 

# 4 (w + x)$ 4 (u — x)$i(v + y)$\(v — y) 

— $ 4 (u 4- y)tf 4 (w — y)# \(v 4- x)^i(i; — x) 

= # 4 (w 4- v)& 4 (u - v)$\(x 4- y)#i(* “ y)- 



6.5 Yang-Baxter Relation 185 


By comparing the reduced Yang-Baxter relation and the addition formulas 
for theta functions, we can find the solution 


CO](u) = 


co 3 (u) — € 


#i(w + X, p) 
#i(A., p) 

#i(u, p ) 


p)' 


(o 2 (u) = 


0 ) 4 (u) = 


#i(A. - U , p) 

$\{Kp) 

#i(2A — w, p) 
#i(2A, p) 


(6.5.29) 


where e = ±1 and X = n/ 4. Following in this fashion, we can use the Yang- 
Baxter relations to find exact solutions for the various statistical mechanical 
models. 


Example: Hard Hexagon Model 

For example, for the hard hexagon model, there are five independent Boltzmann 
weights 

co\ = w( 0, 0, 0, 0; w), cc >2 = w(0, 1, 0, 0; u) = w( 0, 0,0, 1; w), 
a >2 = w(\ , 0, 0, 0; u) = w(0, 0, 1,0; w), 

&>4 i= w(0, 1, 0; w), &> 5 = w( 1, 0, 1, 0; w). (6.5.30) 

Inserting these weights into the Yang-Baxter relation, we find that the equations 
reduce to a set of five equations 

in, r nr / // in, in in 

CO\C0 2 CO l + CO3<jL) 4 C0 3 = (i>2 CO j CD 2 , + CO 5 CO 2 C0 3 = (0\C0 3 Q) 3 

co\(o 2 co 3 + co 2 co' 4 (o 3 — a) 4 co 3 co 2 , co^(o\a)^ 4- cos(o 2 (o 3 = co2(o f 5 co 2 

+ (Os(o' A co 5 = co 4 co' 5 co^. (6.5.31) 


Once again, by comparing these equations with the classical theta addition 
formulas, we find that the solution can be written as 


#i(3A. — u ) 

/.u - 

&i(X — u ) 

/.\a - 

_ #l(«) 

^i(3X> ’ 

w ’ 

V^iW(2A.)’ 

#i(4A. — w) 

(2X — u) 


»,<4» ' 

co 5 =-, 

*i(2A.) 

(6.5.32) 

where X = n/5. 




Example: Eight-Vertex Model 


Last, we can also compare the addition formulas with the Yang-Baxter relations 
coming from the eight-vertex model. We find the exact solution 


w(l , / + !,/,/ — 1;m) = u>(/, Z — !,/,/ + !; w) = 


it)(/ -hi,/,/ — 1, /; m) = w(l — 1, Z, Z + 1, /; w) ± 

#1 [(/ “h 1)A + cuq — w ] 


w(l, l ~h 1,/,/ ~h 1, ; w) = 


^1 (/A. + 6t>o) 


#i(* - w) 

V^(/-i)^(/ + i)^iW 
iKO *(*)’ 


(6.5.33) 



186 6. Yang-Baxter Relation 


where 


1r(a) = » x (ak + o>o) (6.5.34) 

and X and co 0 are arbitrary constants. 


6.6 Solitons and the Yang-Baxter Equation 

Before leaving this chapter, let us briefly sketch another exactly solvable two- 
dimensional theory, that of solitons. They will become important for two 
reasons. First, the heart of their integrability condition is once again the Yang- 
Baxter relation and the commuting of the transfer matrices. Thus, the language 
of conformal field theory can be used to describe solitons. Second, as we will 
see in Chapter 13, the Korteweg-de Vries (KdV) soliton equations become 
important when we solve matrix models, which give us the first nonperturba- 
tive information concerning strings and two-dimensional gravity (albeit in the 
unphysical dimension D < 1). In particular, we will see that the reason why 
the matrix models are solvable in this domain is the existence of an infinite 
soliton hierarchy, called the KdV hierarchy. 

This points to one of the intriguing mysteries of two-dimensional physics, 
the wealth of exactly soluble but highly nonlinear field theories. This must, 
in some sense, have some common origin. Systems as vastly separated as ice 
crystals of hydrogen, Korteweg-de Vries or sine-Gordon soliton descriptions 
of water waves, and the string vacuum of the universe are strangely linked 
together by two-dimensional conformal systems. 

Solitons (for solitary waves) have two distinct qualities. They are two- 
dimensional solutions of nonlinear equations that: 

(1) are localized waves that propagate without changing their properties, such 
as shape, energy, or velocity; and 

(2) are stable against mutual collisions; in multiple soliton scattering, the 
solitons maintain their shape, although they are phase shifted. 

What is remarkable is that these models are exactly soluble in two dimen¬ 
sions, that is, they possess an infinite number of conserved quantities /;. By 
Liouville’s theorem, the model is exactly soluble if there are an infinite number 
of conserved quantities /, that are in involution (their Poisson brackets among 
themselves are all zero). 

In solving these nonlinear equations, ingenious methods, such as the in¬ 
verse scattering method , have been devised. However, over the decades, it has 
become increasingly obvious that the essence of why this inverse scattering 
method works so well is because of the Yang-Baxter equation. Let us list some 
of the more well-known soluble models. 



6.6 Solitons and the Yang-Baxter Equation 187 


Korteweg-de Vries Equation 


The first and best-known integrable model is the Korteweg-de Vries equation, 
formulated by J. Scott Russel to explain the behavior of water waves along 
the Edinburgh-Glasgow canal, which would travel long distances without 
dispersing. The equation has the form 


du 3 u d 3 u 

where u(x , t ) is the height of the water wave. 


0 , 


( 6 . 6 . 1 ) 


Sine-Gordon Equation 

The sine-Gordon equation is given by 

3 2 u 3 2 u 
"dt 2 “ 3^2 

Its soliton solution can be written as 

u = 4 tan -1 |c exp 


= sinw. 

(6.6.2) 

2(t + ux)] 

y/\ — V 2 J . 

(6.6.3) 


Nonlinear Schrodinger Equation 


This generalizes the usual linear Schrodinger equation in two dimensions by 
adding an explicit cubic term to the equations of motion 

.3 u d 2 u 

i — = - — +2k(u u)u. (6.6.4) 

3 1 dx z 

To analyze the solutions to these systems, we will use the method of Lax 
pairs and the inverse scattering method, perhaps the most powerful method 
devised to solve these models. Although the original equations themselves are 
quite nonlinear, the trick is to invent an auxiliary set of linear equations whose 
solution is well understood. 

Let w(j t, t) be a solution of one of the above equations. Then, let i/s(x, t ) be 
a solution of the ordinary linear Schrodinger equation, moving in a potential 
that is precisely u(x, t). Usually, when solving the linear Schrodinger equation, 
we begin with a potential and then solve ir(x,t) moving in that potential. 
However, the key observation is that this process also works backward: given 
the scattering data for i/r(x, r), we can reconstruct the potential u(x , f), which 
is a solution of the original nonlinear equation. 

Therefore, we will start with the linear Schrodinger equation with a field 
if/(x, t ) moving in potential w(j t, t ), governed by the equation 


3 2 ^r 
dx 2 


+ u(x , t)\jr{x, t ) = kx(r(x, t ), 


(6.6.5) 



188 6. Yang-Baxter Relation 


where k is independent of the time and u(x, t ) is a solution of the original 
nonlinear equation of motion. 

The asymptotic form of \/f(.x , t ) for discrete eigenvalues k n = — ^ is 


r) = c n (t)e KX , x -oo, 
c »(0 = c„(0)g- 4 ^', 


( 6 . 6 . 6 ) 


and for continuous eigenvalues k = k 2 , we also have 
T(£, t)exp(ikx + 4ik 3 t ), x —> oo, 




exp(/&* + 4/£ 3 r) + R(k , r)exp (—ikx + 4/£ 3 f), 


where 

T(k, t) = T(k , 0), 

/?(&, r) = /?(/:, 0)exp(— $ik 3 t), 


x -> —oo, 

(6.6.7) 

( 6 . 6 . 8 ) 


where T(£, r) is the transmission coefficient and R(k, t) is the reflection 
coefficient. 

Given the scattering data, it a straightforward problem to reconstruct the 
potential function «(x, r), which in turn is a solution of the original nonlinear 
equation that we seek to solve. A more systematic way of solving the inverse 
scattering problem is to set up the equations in the Heisenberg picture. 

We recall that in the Heisenberg picture, the operators of the theory are 
functions of time. Once again, we will set up an auxiliary problem, except this 
time we will introduce two new operators, called L n and M n , which are the 
Lax pairs. They are M x M matrices, which determine the evolution of the 
wave function V'V* via the equations 


= L m (k)f m , df m /dt = M m xjj'm . (6.6.9) 

Let k be a constant in time. Now, differentiate the first equation with respect 
to time and insert the second equation into the first. Then, it is easy to show 


dL m /dt = M m+ \L m - L m M m (6.6.10) 


which is an operator expression that acts on the wave function ^(jc, r). 

We now claim that each of the previous nonlinear equations can be recast 
in the above form, with suitable choices of the Lax pair. In fact, the model is 
completely integrable if a Lax pair can be found that is equivalent to the above 
consistency condition. For example, let us take the simplest example of the 
KdV equation. If we choose 


L{t ) — D 2 + |m, 

M(t) = 4D 3 + i(D« + uD), 

then the equation 


( 6 . 6 . 11 ) 


dL/dt = [M, L] 


( 6 . 6 . 12 ) 



6.7 Summary 189 


is equivalent to the KdV equation. 

Let us now construct the transfer matrix by taking a product of N matrices 
and taking their trace 


Tn(^) = Tr tN, 

^ = [L a ,(X)L a ,_,(X)-.-L 1 (A)]. 


(6.6.13) 


Differentiate this equation by t. Because of the Lax equation, it is easy to see 
that T N (k) is a constant in time 


dTu{X)/dt — 0. 


(6.6.14) 


Now assume, for the moment, that two transfer matrices with different 
spectral parameters commute 

[T N (k),T N Qi)] = 0. (6.6.15) 


If we power expand the transfer matrix T N (k) as a function of the spectral 
parameter k , then we will have an infinite number of conserved quantities, /*, 
such that they are in involution 

[7i,/,] = 0 (6.6.16) 

for all i and j. Then, by Liouville’s theorem, the model is solvable. For our 
purposes, however, we recognize the commuting of the transfer matrix T N (k ) 
as a signal that there is a Yang-Baxter relation at work. 

The point of this discussion is to reveal that the essential ingredient for 
solubility of two-dimensional models is the Yang-Baxter relationship, which 
is equivalent to commuting transfer matrices. The Yang-Baxter relation, in 
fact, is so powerful that we can write down analytic solutions to infinite classes 
of models based on its solutions. The key to understanding two-dimensional 
quantum systems, and in turn classes of conformal field theory, seems to lie in 
understanding better the meaning behind the Yang-Baxter relationship. 

Last, we mention that we will be meeting the KdV equations from an entirely 
new point of view when we encounter matrix models in Chapter 13. We will 
find that the KdV equations are the key to the exact nonperturbative solvability 
of the string theory in low dimensions. 


6.7 Summary 

One of the principle uses for conformal field theory, in addition to searching for 
string vacuums, is to analyze two-dimensional statistical mechanical models 
at criticality, where the details of the models are washed out and universal 
characteristics remain that typify a conformal theory. What is interesting is that 
so many of these models are exactly solvable. The origin of this remarkable 
property is the Yang-Baxter relation. 



190 6. Yang-Baxter Relation 


We begin with the partition function for a one- or two-dimensional discrete 
lattice 


z = £«p 

n 


E{n) ' 

kT 


(6.7.1) 


where E{n) represents the energy of the nth state, k represents the Boltzmann 
constant, and T represents the temperature. The object is to calculate an exact 
expression for the free energy, defined as 


F = —kT In Z. (6.7.2) 

Correlation functions between spins cr, and Oj at criticality exhibit scaling 
behavior, which can be parametrized by critical exponents. Specifically, the 
correlation function 


8,J = <<w - {a t ){<r } ) (6.7.3) 

will depend on the distance x separating the states, and at large distances, it 
will behave like some decreasing power of x multiplied by some exponential: 

gij ~ x- T e~ x/ S, (6.7.4) 

where £ is called the correlation length. When the correlation length becomes 
infinite, we have a phase transition. 

For the Ising model, we can introduce the “energy operator” e n = cr n cr n +i . At 
criticality, when the theory becomes conformally invariant, we have the one- 
to-one association between the familiar minimal primary fields of conformal 
field theory and the fields a and e of the Ising model: 

O' (f) 1 / 16 , 1 / 16 * €**</) 1/2,1/2- (6.7.5) 

What is surprising is that so many two-dimensional models are exactly solv¬ 
able. The simplest example is the one-dimensional Ising model, whose partition 
function describing spins (which take on values of ±1) can be arranged on a 
line 


z * = ex p ( K J2 (j j cr J + i + h £ a i ) • ( 6 - 7 - 6 ) 

a \ j =1 j =1 / 

We can rewrite this in matrix form as 

z n = V (°^ ct 2)V(p 2 , or 3 ) • • ■ V(a N _ u a N )V(a N , a x ), (6.7.7) 


where we introduce the (2 x 2)-dimensional transfer matrix 


V(cr, cr') = exp 


Kaa' + ^-(cr + o') 


(6.7.8) 


The key is that we can rewrite the partition function as a matrix product over 
the transfer matrix 


Z N =TrV /v . 


(6.7.9) 



6.7 Summary 191 


This means that we can solve the entire system by diagonalizing the transfer 
matrix, as follows: 


Z N = Tr 


^•1 

0 


0 

X 2 


-iW 


+ 7*. 


(6.7.10) 


Let /.[ be the larger of the two eigenvalues, which will then dominate the 
sum in the limit as N —>• oo. We then have 


f(H, T) = -kT lim AT 1 In Z N = -kT In A., 

N-+cc 

= — kT In |V cosh h + y/e 2K sinh 2 h + e~ 2K j , (6.7.11) 

so the system is exactly solvable. (Unfortunately, the system does not exhibit 
a phase transition at finite temperature.) 

More complicated is the two-dimensional Ising model, which can be exactly 
solved only in the zero magnetic field limit. The two-dimensional Ising model, 
like its simpler one-dimensional cousin, can be expressed in terms of transfer 
matrices: 


Vw = exp 
Ww = exp 


Y^(K<T j+l Oj + Lojo'j) 


J =i 


J2( Ka J a J + La J a j+ 1 ) 
L;=i 


(6.7.12) 


where W and V are now 2" x 2” matrices. We can now perform the sum over 
the two transfer matrices by summing vertically over the lattice 


2 n 

Z n - Tr(VW) m/2 = "• (6.7.13) 

1 = 1 

As before, we now diagonalize the transfer matrices and then look for the 
largest eigenvalue. (The details, however, are rather involved.) A careful ex¬ 
amination of its critical exponents at the phase transition shows that the model 
becomes the well-known m = 3 minimal model discussed in Chapter 2. 

After the original one-dimensional Ising model was proposed in 1925, a 
wide variety of exactly solvable models were studied. Some of these models 
can be solved exactly in the continuum limit for all values of the temperature. 
Some can only be solved exactly at the phase transition. However, all of these 
models exhibit universality at the critical temperature, so they can be com¬ 
pared to conformal field theories. Since we have exhausted all classifications 
of the simple bosonic conformal field theories, we should be able to group to¬ 
gether certain statistical mechanical models based on their critical exponents 
and fusion rules according to conformal field theory. Thus, we have a simple 
classification scheme for statistical mechanical models at criticality. Let us just 
briefly sketch some of these models. 



192 6. Yang-Baxter Relation 


The spherical model has the same partition function as the Ising model, 
except that the spins obey the constraint 

N 

I>;=W. (6.7.14) 

j = 1 

(Although this constraint seems unphysical, linking together spins no matter 
how far apart they are, it can be shown to be a special limiting case of the 
h- vertex model.) The advantage of this model is that it can be solved exactly in 
the presence of a magnetic field, which is an advantage over the Ising model. 

The ice-type , six-vertex model differs from the Ising model qualitatively, 
because the energy is contained not at the sites but at the links or edges between 
the sites (i.e., the energy is concentrated in the chemical bonds between atoms, 
and hence, this model can describe systems with hydrogen, such as ice or 
dihydrogen phosphate crystals). 

Its partition function is given by 

Z = X> p(-tf)- < 6 - 7 -' 5 > 

where is the energy associated with each link, and n, is the number of times 
the ith site is repeated in the lattice. Different values of and n/ describe 
different types of crystals. The model is exactly solvable, but it exhibits some 
nonphysical properties, such as complete ordering even at nonzero tempera¬ 
tures. Because of these nonphysical properties, the model was generalized to 
the eight-vertex model , which places constraints on the 16 possible ways in 
which arrows can be placed going into and out of each site. 

The eight-vertex model, for various values of and n iy is so general it can 
model both ferromagnetics and ferroelectrics, and in fact, it includes the six- 
vertex and Ising models as special cases. In fact, for a special case, we can 
show 


Rising — ^^eight-vertex (6.7.16) 

(which shows that a model with energy associated with sites, like the Ising 
model, can be rewritten as a model with energy associated with links, such as 
the vertex model). 

Also, Potts introduced two new models. The first is the Z* model , where the 
spins can assume values in Z k instead of just ± 1. Spins can now be represented 
as vectors in this space, so the energy becomes 

E {°V ' a U+i + ai -i ' <T,+1 -J} (6.7.17) 

Uj 

(obviously, this is the familiar Ising model for N = 2). 

However, the Potts model is defined by letting the spin a t at the ith lattice 
site take on values from 1 to q. Two nearest neighbor spins interact via the 



6.7 Summary 193 


delta function and are defined as 

8(cr, a') = 1 
8(a, o') = 0 


if a — a', 
if a ^ cr\ 


(6.7.18) 


where 8 is 1 if the two spins are the same and 0 if they differ. This model can 
be solved exactly. For the q = 2 case, we have the Ising model. For the case 
of q = 3, we also have a minimal model. 

The well-known XYZ Heisenberg model has a partition function given by 


n = { J x a j a j +1 + J y a j a j+ 1 + J z a ) a U 1 + • • •} (6.7.19) 

7 + 1 

where the ellipses represent the interactions in the vertical direction. 

If J x = J y = then this is the usual Heisenberg model. 

If J x — J y = 0, then only J z survives, and hence, we obtain the usual Ising 
model. 

If J z = 0, then we have the XY model. 

If J x = J yy then we have the Heisenberg-Ising model. 

It can be shown that the Hamiltonian, for any value of the J ’s, can be written 
as the logarithmic derivative of an eight-vertex transfer matrix. 

The Ashkin-Teller model is another generalization of the Ising model, except 
now we have different species of atoms. Its partition function is roughly the 
same as the Ising model, except we have different types of spinors to sum over. 
(The model is not solvable, but its properties at criticality are known.) 

The hard hexagon model is exactly solvable and represents a gas of hard 
(i.e., nonoverlapping) molecules. The partition function is 

N /3 

Z = £ z n g(n , N ), (6.7.20) 

n=0 

where g(n, N) is the number of ways in which n particles can be placed in 
each of the various hexagons. There are N sites, and hence, at maximum, only 
N /3 sites can be occupied. 

Now, we turn our attention to the main problem, which is the origin of 
why these models are solvable. A close examination of the steps used to solve 
these models shows that the key ingredient is that partition functions can be 
expressed entirely in terms of transfer matrices and that these transfer matrices 
commute. The mathematical expression of commuting transfer matrices, in 
turn, is expressed by the Yang-Baxter relationship, one of the deepest results 
of two-dimensional statistical mechanics. 

The Yang-Baxter relation can be expressed graphically in two ways, de¬ 
pending on whether we are studying vertex-type models or IRF (interaction 
around a face) models. 



194 6. Yang-Baxter Relation 


Let us begin with a vertex model. Let w{fii , a t \j3 i9 1 ) represent the con¬ 

tribution to the sum from the /th site. Each Greek index, in turn, can have 
values of ±1, such that there are only six possible orientations. Now, let us 
introduce the transfer matrix 

V a ,e = 'Yh' ■ - IX 2 )w{lX2,Cl2\^2, (6.7.21) 

Ml Mn 

The partition function can be represented totally in terms of transfer matrices. 
By demanding that transfer matrices commute, we have a nontrivial relation, 
the Yang-Baxter equation 

E + v)S» = E + «)S£(«). (6.7.22) 

afiy a/3y 

where the S’s are defined in terms of the u;’s. Similarly, the IRE models can 
also be solved via the Yang-Baxter equation, except that the topology of the 
relation resembles a star and a triangle, hence the name “star-triangle relation.” 

Last, we conjecture that perhaps all two-dimensional soluble systems have 
the same origin in the Yang-Baxter relation. One example of this is soliton 
theory (which we will meet again in Chapter 11). Solitons can be described by 
M x M matrices, which determine the evolution of the wave function xj/,, via 
the equations 


tm+ 1 = d\j/ m /dt = M m ^ m . (6.7.23) 

Let A be a constant in time. Now, differentiate the first equation with respect 
to time and insert the second equation into the first. Then, it is easy to show 

dL m /dt = M m+ \L m L m M m (6.7.24) 

which is an operator expression that acts on the wave function xjf(x, t). 

We now claim that each of the previous nonlinear equations can be recast 
in the above form, with suitable choices of the Lax pair. In fact, the model is 
completely integrable if a Lax pair can be found that is equivalent to the above 
consistency condition. 

Let us now construct the transfer matrix by taking a product of NL matrices 
and taking their trace 

T N (X) = Trt N (6.7.25) 

In = [Ljv(A)Ljv_i(A) • • • Li(A)] 

Differentiate this equation by t. Because of the Lax equation, it is easy to see 
that T N (k) is a constant in time 

dT N {X)/dt = 0. (6.7.26) 

Now assume, for the moment, that two transfer matrices with different 
spectral parameters commute 

[T n (X) 9 T n (jjl)] = 0. 


(6.7.27) 



References 195 


If we power expand the transfer matrix T N (X) as a function of the spectral 
parameter A., then we will have an infinite number of conserved quantities, /,, 
such that they are in involution 

[/„/,] = 0 (6.7.28) 

for all i and j. Then, by Liouville’s theorem, the model is exactly solvable. 
Thus, we see once again the strong relationship between exactly solvable two- 
dimensional systems. 


References 


1. C. N. Yang, Phys. Rev. 85, 808 (1952); Phys. Rev. Lett. 19,1312 (1967). 

2. R. J. Baxter, Exactly Solved Models in Statistical Mechanics , Academic Press, 
San Diego, 1982; Ann. Phys. 70, 193 (1972). 

For reviews, see Refs. 3-6. 

3. E. H. Lieb and F. Y. Wu, in Phase Transitions and Critical Phenomena , C. Domb 
and M. S. Green, eds., Academic Press, San Diego (1972). 

4. M. N. Barber, in Phase Transitions and Critical Phenomena , C. Domb and J. L. 
Lebowitz, eds., Academic Press, San Diego (1983). 

5. H. W. Diehl, in Phase Transitions and Critical Phenomena , vol. 10, C. Domb and 
J. L. Lebowitz, eds., Academic Press, San Diego (1986). 

6. B. M. McCoy andT. T. Wu, The Two-DimensionalIsingModel, Harvard University 
Press, Cambridge, MA (1973). 

7. E. Ising, Z. Physik. 31, 253 (1925). 

8. L. Onsager, Phys. Rev. 65, 117 (1944). 



CHAPTER 7 


Toward a Classification of 
Conformal Field Theories 


7.1 Feigin-Fuchs Free Fields 

In order to make some sense out of the jungle of conformal field theories that 
have been discovered from string theory, physicists have tried to classify these 
vacuums using various techniques, with varying degrees of success. At present, 
no comprehensive classification scheme exists that gives us insight into the 
structure of these vacuums. In fact, it is still largely a mystery why conformal 
field theories behave as they do. There has been some progress in understanding 
conformal field theories with finite numbers of primary fields, but there is 
almost no real understanding of conformal field theories with infinite numbers 
of primary fields. If a convenient and powerful classification scheme could 
be devised, then it may be possible to see nontrivial relationships between 
different conformal field theories, which in turn may help us to understand 
which, if any, of these conformal field theories have a physical application. 

Although a satisfactory classification scheme does not yet exist, in the last 
few years much progress has been made toward developing different schemes 
that can partially catalog the multitude of conformal field theories. Let us list 
some of the major formalisms that have been proposed, mentioning their strong 
and weak points. 


(1) Coset Construction [ 1 ]: The GKO coset construction, reviewed in Chapter 
2, was one of the earliest to be discovered and is still one of the most pow¬ 
erful techniques for categorizing conformal field theories. The minimal 
conformal field theories can be easily constructed via this method. In fact, 
all known rational conformal field theories can be constructed using the 
coset construction. 




7.1 Feigin-Fuchs Free Fields 197 


However, one drawback to this construction is that it reduces the problem 
of finding representations of the Virasoro algebra to an equally difficult 
problem, finding representations of the Kac-Moody algebras. In particular, 
the method has somewhat limited usefulness because, in order to construct 
the correlation functions for G/H , one must know them for G and H. 
Although this procedure provides a remarkably versatile method by which 
to construct conformal field theories, it is sometimes a rather clumsy way 
in which to actually calculate the characters, primary fields, etc., of the 
model. More important, however, it gives us no deeper understanding into 
the reason why the multitude of conformal field theories exists, nor does 
it help us understand the relationships between these theories. 

(2) Feigin-Fuchs [2-9] The Feigin-Fuchs free field method, in contrast to the 
coset method, gives us information about computing correlation functions 
by reducing them to a series of line integrals over the complex plane. By 
adding fields at infinity, it gives the ability to reduce complicated conformal 
field theory correlation functions to correlation functions of free fields. 
Its power is that it gives us a very practical way in which to calculate 
with a conformal field theory. In fact, it can be shown that all GKO coset 
constructions can be derived via Feigin-Fuchs free fields. Its disadvantage 
is that, like the GKO construction, it gives us no deeper understanding of 
the relationships between conformal field theories. 

(3) Landau-Ginzburgand Catastrophe Theory [10-16]: The Landau-Ginzburg 
method gives us a new way of looking at the relationship between 
conformal field theories. The Landau-Ginzburg potential, coupled with 
renormalization group methods, gives us a way in which certain conformal 
field theories may flow into each other via renormalization group flows. 
The Zamolodchikov c theorem, especially, gives us a powerful way in 
which to see how certain conformal field theories can flow into other ones. 
For the N = 2 superconformal field theories, catastrophe theory may be 
applied to these potentials. Because the mathematicians have already made 
great strides in the classification of catastrophe theory, perhaps one can use 
this classification scheme to classify the Landau-Ginzburg potentials of 
the N = 2 theory. 

Although this formalism is quite beautiful, it is not as general as the other 
methods. Many conformal field theories cannot be written in terms of 
Landau-Ginzburg potentials, and they lie outside the classification scheme 
of catastrophe theory. 

(4) Knots and Chern-Simons Theory [17]: Perhaps the most original attempt 
in which to approach the classification problem is Witten’s use of knot 
theory. Because a three-dimensional Chern-Simons gauge theory is purely 
topological (i.e., it is generally covariant without any metric tensor), its 
correlation functions are also purely topological. The correlation functions 
over Wilson lines give us invariant knot polynomials, generalizing the knot 
polynomials found independently by the mathematicians. 



198 7. Toward a Classification of Conformal Field Theories 


If we quantize the system and take a time slice, one dimension is lost, and 
the theory becomes a two-dimensional conformal field theory. If we apply 
the Dirac constraints directly onto the Hilbert space, then the physical 
space is equivalent to the conformal blocks found in Chapter 2. The three- 
dimensional action, in the Coulomb gauge, becomes a version of the two- 
dimensional WZW model. 

Like the coset and free field construction, knot theory can also describe all 
known rational conformal field theories. The drawback with this approach 
is that the beautiful geometry behind knot theory does not, at the moment, 
give us any insight into the classification of conformal field theory. Like 
the other schemes, it is still obscure how this approach can reveal to us the 
relationships between conformal field theories. 


In this chapter, we will review the free field and the Landau-Ginzburg ap¬ 
proaches. (We will save our discussion of knot theory until Chapter 8, where 
we will use knot theory to unify many of the features of conformal field theory 
and statistical mechanics.) 

The Feigin-Fuchs method begins with the observation that the A-point 
correlation functions of the minimal model, which in general are difficult to 
compute, have a representation entirely in terms of free boson fields. All 
correlation functions, as well as all structure constants, can be explicitly 
calculated. 

We start by making some elementary observations about the N -point 
function of N interacting tachyons, which were discussed in Chapter 1: 


N 

{V ai (zx)V ai (z 2 )... V aN {z N )) - (7-1-1) 

i<j 


which vanishes unless = 0. In general, if we wish to take the correlation 
function of several vertex functions of the same type V a , then, since the a are 
all positive and can never sum to zero, we find that the correlation function 
is trivially zero. Feigin-Fuchs, however, discovered a trick by which general 
correlation functions can be constructed, even when the a t do not sum to zero. 

Let us recall the discussion of the energy-momentum tensor and vertex 
operators given in Eqs. (3.1.28)—(3.1.33). Let us choose: 

T(z) = : d z <f> d z <t> : +ia 0 (7.1.2) 

where we have set € = —1 and Q = —2/a 0 - Then, the central term is given 
by c = 1 — 12 ^ 0 , and the conformal weight of the vertex function : e l0t *: is 
equal to ja(a — 2a 0 )> that is, 


T(w) : e 




a 2 — 2aa { 


0 : e ia ^ : 


d : e ia4> : 


2(w - z) 2 


w — z 


(7.1.3) 



7.1 Feigin-Fuchs Free Fields 199 


Now, consider the operator [2-5]: 

Q = j> dz J(z), J(z) = : e ia<Kz) : . (7.1.4) 

If we choose ao such that the integrand J(z) has weight 1, then we have 

a 2 — 2aa 0 = 2. (7.1.5) 

which has two solutions for a 0 : 

a ± = a 0 ± yjal 4- 2 (7.1.6) 

The whole point of the Feigin-Fuchs construction is that Q is conformally 
invariant (and hence may be inserted into a correlation function without affect¬ 
ing its conformal properties) but carries a nontrivial “momentum” a, which can 
be adjusted so that the total “momentum” of an AT-point correlation function 
vanishes. For example, the correlator 

{VaVaVaVl^-a) (7.1.7) 

is usually equal to zero because the momenta do not sum to zero. 

However, we will alter this correlator in several ways. First, we will insert 
a new vertex into the correlator, located at z = oo with momentum — 2a$. 
This will represent the “screening charge.” It will also partially cancel the 
momentum due to the last vertex function. Second, we can insert as many 
Q± with momentum a± into the correlator as we want, so let us insert n — 1 
operators Q _ and m — 1 operators Q+. Third, set all a t equal to a. 

To have a nonzero matrix element, we must have the sum of all momenta, 
coming from both V a and Q ±, equal 2a 0 . Depending on n and m, this fixes 
the value of a to be a nm via 

(3 - l)a n m 4- (w - l)a_ + (m - l)a+ + 2a 0 - 2a 0 = 0. (7.1.8) 

Solving for a n m , we find that the conformal weight A nm of the vertex V anm 
must satisfy 

2A nm = ot 2 n m - 2a n , m a 0 = \[{a~n + a + m ) 2 - (a + + a_) 2 ]. (7.1.9) 

But this, however, is precisely the form of the Kac formula [see Eq. (2.5.7)]. 
The correspondence becomes complete if we set al = [2m (m + 1)] 1 and 
p — m and q = n. 

This is a rather unexpected, but fortunate, result. If we set the conformal 
weight of V a to be the conformal weight found in the minimal model, then we 
can obtain a nonzero correlation function by inserting a certain number of Q ’s 
into Eq. (7.1.7), which do not change the conformal structure of the correlation 
function but do change the momentum-conservation equation so that nonzero 
correlation functions are found. 

This means that we have found a representation of the correlation functions 
of the minimal model in terms of free boson fields. Since the transformation 



200 7. Toward a Classification of Conformal Field Theories 


properties of these free boson vertex functions agree precisely with the trans¬ 
formation properties of the minimal model’s fields then we now have a 
convenient way in which to calculate the N -point functions and structure con¬ 
stants of the minimal model. For example, we can find an explicit expression 
of the four-point function over minimal fields 0 n?m if we insert n — 1 currents 
with weight and m — 1 currents with weight as follows: 


(0n,/n(^l)0n,m(^2)0n,/n(^3)0n,m(^4)) 

= C P dui ... (p du n _\ (p dvi ...(p 

JCl JC n —l JS , JSn, 


X \ V Ctn,m(Z 0 ‘ ' V Ct n , m {u)J-{U\) . . . J-(U n _ l) 


X J+(vi)...J+(v m . l 


(7.1.10) 


where the contour integrals are over circles {C x ,..., S m _i}, which enclose the 
points n so that they cannot be shrunk down to a point. 

We have now replaced an abstract field from the minimal model 4> n , m with 
a specific representation given by free fields whose matrix elements are all 
known. We can do this because the left- and right-hand sides of the previous 
equation have the same conformal properties. The final contraction over the 
vertices and currents is now trivial, since all fields are written in terms of free 
bosons. Thus, we have reduced a potentially difficult problem, the calculation 
of N -point functions over minimal fields, to a much simpler, almost trivial one: 
performing line integrals over complex-valued vertices constructed from free 
fields. 


Example: Four-Point Function 

For example, let us use this deceptively simple method to calculate the explicit 
value of a correlation function of minimal fields, such that one of the fields is 
equal to 01,2 (we now rescale a -> Via to agree with the literature): 

{(pn^rtu (^) 01 , 2 (^) 0 n 3 ,m 3 (l) 0 n 4 ,/W 4 (^O)j 

= £ dt{V ai (0)V a2 (z)V a ,(l)V a4 (oo)J + (t)] 

= z 2a,ai (l - z ) 2aia3 £ dt t a (t - l) fe (r - zf, (7.1.11) 

where a = 2 aia+, b = 2 a 3 a+, c = la 2 a+, a 2 = — and (¥4 = 2ao — 
a 1 — — a + . 

Although the final answer is unique, there is some arbitrariness in defining 
the line integrals (which is eliminated once we fix the monodromy properties 
of the integral). An incorrect choice of the line integral, for example, could 
lead to a vanishing result. 

There are two line integrals in this calculation, corresponding to the two 
linearly independent solutions to the hypergeometric equation. It is straight¬ 
forward to write these line integrals, in turn, as hypergeometric functions. Let 



7.1 Feigin-Fuchs Free Fields 201 


us define the following: 


b , c;z ) 


/ 


dvv a (v — 1)*(d — z) c = 


r(-a -b-c- 1)T(6 + 1) 


i T(— a — c ) 

X F(—c, — a — b — c — 1, — a — c; z), 


I 2 (a, b, c; z) = f dv v a (l - v)\z - v) c = ; iwc r (a + l)r(c + l) 
Jo r(# + c + 2) 


x F ( fl + l,a + c + 2;z), 


(7.1.12) 


where F is the standard hypergeometric function. The final result for the cor¬ 
relation function is a function of both z and z and is constructed out of I\ and 
/ 2 . Thus, the correlation function must be a function of 


G(z,z) = J2 x U I iij(zl (7.1.13) 

where the X, j can be determined by making changes in the contour integrations. 
The final answer is [3-5]: 


where 


{fon ii (z ] )0n 2 >^2(Z3)0n 4 ,m4 (Z4 )| 
|Zl3| fc |Z24| ft4 


|Zl2^ l2 |Z23l & |Z34l fe4 |Zl4l A4 


G(z, z). 


(7.1.14) 


G(rj) = sinjr(u + b + c)sinjr(6)|/i(tf, b, c; rj )\ 2 

+ sin7r(a)sin^(c)|/ 2 (a, b, c; rf) | 2 , ( 7 . 1 . 15 ) 

and t] = Z12Z34/Z13Z24 and 

^13 = 2 [A(a[ +0:3-1- «+) - Ai — A 3 + 2 a + a 2 ], 

^24 = 2 [A(a : 2 + a 4 ) — A 2 — A 4 + 2 a + a: 2 ], 

A2 = — 2 [A(ai + a 2 ) — Ai — A 2 ], 

_ i (7.1.16) 

£23 = - 2 [A(a 2 + a 3 ) - A 2 - A 3 ], 

^34 = — 2 [A(a 3 + a4 + a + ) — A 3 — A4], 

>Si4 = - 2 [A(a! + 0:4 + a +) — A) — A 4 ], 
where A are the conformal weights. 


Example: General Case 


We can now present the general case, which appears formidable but is actually 
a straightforward application of the ideas presented previously. First, we start 
with the correlation function of a product of a series of <p nm . Then, we replace 
each minimal field with a vertex function 


4>n, m -* V antm . (7.1.17) 

To prevent the correlation function from vanishing, we have to insert the req¬ 
uisite number of charges Q± (with weight 1) within the correlation function, 



202 7. Toward a Classification of Conformal Field Theories 


such that the total “momentum” still vanishes. The contraction over the free 
boson field can now be trivially performed, and we are left with a series of line 
integrals. The last step is to write the final correlation function as 

( 7 - L18 > 

U 

where 7 Z are various line integrals, and then use the monodromy properties of 
the correlation function to fix X t j . 

This procedure can be performed for any A-point function, but let us present 
only the final result for the four-point function with four totally arbitrary min¬ 
imal fields. As before, the four-point function can be written as a function of a 
set of contour integrals //^ m) multiplied by normalization factors [3-5]: 

(^ 1 , mi (0)^ 2 , m2 (z)</>„ 3>m 3(i)^,m 4 (oo))= *rv/r , i 2 - (7-i.i9) 

i,k 

These contour integrals, in turn, can be written in terms of the following factors: 

//; m) = Nlr } Ft\ (7-1-20) 


(7.1.21) 


where 

N it m) = Jn-i,m-k[~a - b-c- 2p(m - 2) + 2(n - 2), b; p\Ji- X , k -\(a, c; p) 
and 

n w n rw 

A r(l - m + a' + ip')r(l - m + /S' + ip') 

| = j T[2 — m + a' + fi' + (n — 1 + /)/£>'] 


n 


r(i+a + ip)r(i + p + ip) 


f = | T[2 — 2n + a + fi + (m — 1+ i)p] 
and the normalization factors X are given by 

X% m \a, b, c; p) = Xf\ab\ c'\ p')X?\a, b, c; p), 

where 

4->=n 4 (.»n 4 (.»fi S<1+ ° +i » w+c+ii,) 


(7.1.22) 


(7.1.23) 


1=1 1 = 1 
m=k -1 


n 

i=0 


f = J s[2 + a + c + (k - 2 + i)p] 

^(l -f b + ip)s[— 1 — a — b — c — 2p(m — 2) + ip] 


s[—a — b — 2p(m — 2) + (ra — k — 1+ i)p] 


(7.1.24) 


x 



7.2 Free Field Realizations of Coset Theories 203 


where s(a ) = sin na and 

4" m) (z) = ft\zKzf 

q=(l-l)[l+a' + c' + p'(l-2)] 

+ (k- 1)(1 + a + c + p(k-2)]~ 2(1 - 1)0 - 1 ), ( 7 . 1 . 25 ) 

where f^ m \z) is regular at z = 0 and 0) = 1. Also: p — a+, a' — 

— p~ l a , b f = — p~ l b , c' = —p~ l c. 


7.2 Free Field Realizations of Coset Theories 

So far, we have only used the free field method to analyze simple models, like 
the minimal model. However, the free field method is much more powerful than 
that. Now, we wish to analyze generalized free field constructions in order to 
realize Kac-Moody algebras, coset constructions, and N = 2 superconformal 
theories [8, 9]. Thus, the free field construction is one of the most general 
schemes proposed. 


Free Field Construction of Kac-Moody Algebras 

First, let us analyze the free field Kac-Moody representation by introducing 
three sets of free fields /L a , yp, <\> l , where a and fi represent the positive roots 
of some Lie algebra and i is the index for the Cartan subalgebra. Our goal is 
to construct the Kac-Moody generators out of these free fields. We postulate 
the following operator product expansions: 


P-a(z)yp(w) ~ 

z — w 


(p l (w) ~ —S lj ln(z — w). 


(7.2.1) 


First, the generators of the Cartan subalgebra can be written as 

H‘(z) = -ia+ d<p‘ + ^2 Qf 'P- a y a (z), (7.2.2) 

aeG 


where the sum runs over positive roots and where = fk + g is the second- 
order Casimir of the group G, that is, g = n for A n , g = n — 2 for SO(n) 
(n > 5), and g = n + 1 for C n . Likewise, the currents for negative roots can 
be written as 

E- a (z) = P~ a (z) + J2 N po y p (z)p- a (z), (7.2.3) 

p—cr = —a 

where the N matrices can always be chosen so that the algebra formed by 
the E ’s and W s agrees with the usual definition of the Lie algebra, as in Eq. 
(3.2.6). 



204 7. Toward a Classification of Conformal Field Theories 


Last, the energy-momentum tensor can be written as 

1 r ank G • r ank G 

T(z) = £ P- a dy a - - Y, [9*'(z)] 2 9 V(z). ( 7 - 2 - 4 ) 

<*<=G Z ; = 1 “+ i = l 


where p‘ is half the sum of the positive roots. (Notice the last term in the 
expression for the energy-momentum tensor. Because it is linear in the fields, 
it shows the presence of the screening charges that are typically found in the 
Feigin-Fuchs construction.) 

It is now a simple matter to calculate the operator product expansion of 
the various operators. By multiplying two energy-momentum tensors, we can 
calculate the central charge of the algebra, which yields [8, 9]: 

c G — dim G — 12 p 2 /a 2 + . (7.2.5) 


If we use the “strange formula” of Freudenthal and de Vries, 

p 2 = 1 \g 2 dim G 

we find the correct central charge of the Kac-Moody algebra: 

k dim G 


(7.2.6) 

(7.2.7) 


Equations (7.2.2), (7.2.3), and (7.2.4) then define the complete generators of 
the Kac-Moody and Virasoro algebras in terms of Feigin-Fuchs free fields. 


Free Field Coset Construction 

Next, we will use the free field construction to give us the GKO coset con¬ 
struction. As before, we begin with the same set of fields and the same T c . 
However, there is a small complication in constructing the current J a for the 
subgroup H. We demand that the current have the operator product expansion 

Ttci o jra 

T G {z)J' a (w) ~ + -• (7.2.8) 

(z-w) 2 z-w 

We demand that J' a also have the same operator product expansion with respect 
to T h . Thus, the difference T GjH = T G -T H has the following operator product 
expansion: 

T GIH {z)J ,a {w) ~ 0. (7.2.9) 

The trick is to find the representation of J' a in terms of free fields. If we 
naively take the construction used previously for Kac-Moody operators, we 
find that free fields will not work. 

We will, therefore, have to modify some of our operators in order to con¬ 
struct J ,a . We will construct the generators of the subgroup H out of modified 
fields denoted by a prime. We construct the generators of the subgroup H by 
identifying the fields F ~ a and y' a with those of the full group G. To calculate 



7.2 Free Field Realizations of Coset Theories 205 


</>', we equate the generators H‘ and H" to be the same. Equating the two, we 
find all the terms are the same except 

— iVk + h d(j>''{z ) = —tV* + h d<p‘(z ) + a‘P- a y a . (7.2.10) 

a<=G/H 

Then, we easily find the expression for J ,a , as well as [8, 9]: 

T G /h(z) = ^2 P- a (z)dy a (z) - \[dp(z)f - p G 9 2 4 >(z) 

cxgG/H 

- W(z)] 2 - = PH • (7.2.11) 

l V« + h ) 


Free Fields and Supercoset Models 

Next, we can use the free field construction to give us the superconformal theory 
as well. The generalization to the superconformal case is straightforward. We 
simply double all the fields by including a Grassmann variable, so Z = (z, 9). 
The operator product expansion of the free fields now generalize to 


B-a(Z)Cp(Z') 


8 a p(9 - 9 0 


Z - Z' 


d> i (Z)4> j (Z') ~ -S ij ln(Z - z'). 


The generators of the Cartan subalgebra become 

H\Z) = -iVk D<&‘(Z) + J2<x i B- a (Z)C a (Z). 

a&G 


(7.2.12) 


(7.2.13) 


In this way, we find that the energy-momentum tensor for the coset is given by 
T g (Z) dC °( z ) + DB_ a DC a (Z)] 

1 a<=G 

- i D<tHZ) D 2 <t>(Z ) - 2= Pc D 3 <t>(Z). (7.2.14) 

As expected, we find that the central charge is given by [8, 9]: 

^ = idimG-al = (i-i)dimG. (7.2.15) 

As before, we find that the naive construction of the Kac-Moody current 
fails for free fields. Once again, we construct the generators of the subgroup 
H by identifying the fields /?'_<* and C' a with those of the group G, and the 
fields <t>' are also chosen by setting H l equal to H ' 1 , giving us 


D<J>"(Z) = D<1>‘(Z) H— l — a ‘ B-aC a {Z). 

v k a gG/H 


(7.2.16) 



206 7. Toward a Classification of Conformal Field Theories 


The energy-momentum tensor of the coset can now be represented by 


Tgih(Z) = \ £ [B- a (Z)dC a (Z) + DB a (Z)DC a (Z )] 

^ cteG/H 


\ 

2 


D<t>(Z) £> 2 <D(Z) - 4= Pc D 3 <t>(Z) 


+ i DO'(Z) D 2 $>'(Z) 


(7.2.17) 


Free Fields and N = 2 Superconformal Algebra 

Last, we show that the N = 2 superconformal field theories can also be written 
via free fields. The problem, as we saw in the previous chapter, is to find a scalar 
current J(Z) that will generalize the N = 1 algebra into an N = 2 algebra. 

In terms of free fields, the current is 


■nz)= £ 

aeG/H 


B- a DC a 


+ j(p c ■ a)D(B- a C a ) + -j= [B- a C a (a ■ D$>) + a ■ D 2 <1>] j 


+ £ A a , p [2D(B_ a C a ) + B. a C a B^C^\, (7.2.18) 

a,pe.G/H 


where A a $ is an antisymmetric matrix that does not contribute to the energy- 
momentum tensor. This field, in turn, has the correct operator product 
expansion [8,9]: 


J(Z)J(Z') ~ 
T(Z)7(Z') ~ 


3(Z - Z') 

( 0-00 
(Z - Z') 2 




Z - Z' 


HZ') + 


l 


2(Z - Z) 2 


(7.2.19) 

D'J{.Z’) + y^ jd'J(Z’). 


In this way, we can construct free field representations for the N = 2 theories 
of Gepner, Kazama, and Suzuki. 


7.3 Landau-Ginzburg Potentials 

The Landau-Ginzburg method [10] approaches conformal field theory from 
a different point of view, using renormalization group arguments with an ini¬ 
tially nonconformally invariant theory. We start with a scalar theory with the 



7.3 Landau-Ginzburg Potentials 207 


following interaction term: 


L~g J d 2 x <P 2(p ' l) (7.3.1) 

for fixed p. We notice immediately that the theory is not conformally invariant 
by dimensional arguments. However, it is possible to calculate the fi function 
of the theory and find where it vanishes, that is, its fixed points. When the fi 
function vanishes, then the theory becomes conformally invariant. 

In this way, it is possible to find the relationship between the fixed points of 
the Landau-Ginzburg action, where the theory becomes conformally invariant, 
and known conformal field theories. In particular, we will compare the com¬ 
posite operators for fixed p that one can construct from the above interaction 
and then compare them with the operators appearing in the minimal series, 
and we shall argue that there seems to be a correspondence between them. We 
find that the theory defined by the potential 4> 2(/7-1) at criticality corresponds 
to the familiar unitary minimal model with c — 1 — 6 /p(p + 1). 

To show the relation, we will argue that : <i> k : has the same conformal 
expansion as one of the minimal fields 0*+i,*+i, and hence we can establish 
a one-to-one correspondence between Landau-Ginzburg composite operators 
and minimal primary fields. 

At the fixed point, one finds a series of composite operators that are equal to 
powers of the scalar field d>" for n — 1, 2,..., 2p — 4, as well as derivatives of 
these fields. (For the sake of argument, we will assume that we have averaged 
over all two-dimensional directions so that derivative terms will be dropped.) 
Fields with powers higher than 2 p — 4 are discarded. (This is because the 
equations of motion for the theory 

d- z d z d> - <t> 2p ~ 3 (7.3.2) 

show that d> 2/7-3 must have dimension greater than 2. However, according to 
renormalization group theory, the addition of operators with dimension greater 
than 2 does not change the point to which the renormalization group flows.) 

These composite fields, of course, have no meaning until we define what 
they mean by proper normal ordering. However, for higher powers of the field 
the definition of normal ordering is ambiguous (because we must subtract 
divergent terms that are now operators, not just ordinary numbers). 

To provide a self-consistent definition of normal ordering for higher-order 
operators, let us first reexamine the operator product expansion for two fields 

4>(z)<D(0) - (<t>(z)<t>(0)> - Izl*- 2 *<J> 2 (0) + • • •, (7.3.3) 

which serves to define the composite field : <t> 2 : = d> 2 (where d t is the anoma¬ 
lous dimension of the ith field). We can use the above equation to successively 
define what we mean by normal ordering for higher powers. 



208 7. Toward a Classification of Conformal Field Theories 


Thus, assuming that the fcth composite field is well defined, we can make 
sense out of expressions like <t> i+1 via 


: d>* +1 : (0) = lim Izl^ 1 ^ 1 

z -*0 


: d>(z) : 4>*(0) 


k/2 

\z\ dt - 2 *- dl ~ ik : : 


(7.3.4) 


where the coefficients A k are chosen so that the series is well defined. 

Now that the operator product expansion for all composite fields is well 
defined, let us compare this with the operator product expansion found for 
minimal models, for example, 


02,20n,m ~ ^ Xf^[<p n+k , m+ l H-]. (7.3.5) 

k,l 

If we make the correspondence 4> -o- 0 2 ,2 and compare the two sets of 
operator product expansions, then we find that we can make the correspondence 
between the two sets of fields. In the previous chapters, we computed the value 
of the structure constants, so it is now a simple matter to compare the two 
operator product expansions for and for the minimal primary fields <p nM 
and find the correspondence between the two sets of fields. We find that we 
can make the correspondence [10]: 


: <!>* :•** <pk+i,k+i> k — 0 , 1,..., p — 2 , (7 3 6 ) 

: <f> k -p+2,k-p+i, k = p - 1, p, p + 1,..., 2p - 4. 

Now, let us analyze the correspondence between the superconformal mini¬ 
mal series and a superfield Landau-Ginzburg theory at criticality. Let us start 
with the Landau-Ginzburg action 

L = j d 2 zd 2 e[kD®D<t> + g<$> p ] (7.3.7) 

where we introduce the following derivatives: 

D = d e - 0d z , D = d# — 9d- z , (7.3.8) 

and 

<1 > = (f> + 9\jr + 9\jr + 90x- (7.3.9) 

By contrast, the superconformal minimal series is defined by the series [see 
Eq. (2.7.10)]: 

_ 3 12 
C ~2 p{p + 2 )’ 


p = 2, 3,.... 


(7.3.10) 



7.4 N = 2 Chiral Rings 209 


The primary fields <p, um obey 


— $p+2—n,p —^ — 1? 2, ., pi" 1, 

for m + n odd. This field has dimension 


m - 1,21, 
(7.3.11) 


h — 

rL nm — 


[(np - m(p + 2)] 2 - 4 
4p(p + 2) 


(7.3.12) 


Then, by once again making the correspondence between d> = and 
by checking the one-to-one correspondence between the operator product 
expansion of <t> k and 0„, m , we can make the correspondence 

, * = 0,l,2,...,p-2. (7.3.13) 


The same correspondence can be established for the N = 2 superconformal 
series. The Landau-Ginzburg action for this theory is given by 

L = j d 2 zd 4 e D(<t>, Q) + g J d 2 zd 2 e F(<J>), (7.3.14) 

where the first term is called the D term and is integrated over all four values 
of 9 , while the chiral F term is integrated over only two of them. We choose 
F = 4>". 

Let us now compare this with the superconformal minimal series, which is 
given by 

c= 1 -(2 In). (7.3.15) 


The fields are given by 0y m , which have conformal weights hj m and £/(l) 
charges qj m given by 


hjm — 


jU+2)-m 2 

An 


m 

Qjm = — 
n 


Finally, we make the crucial identification of 


(7.3.16) 


<S> P *+ 4> p _ p 


(7.3.17) 


7.4 N = 2 Chiral Rings 

The most physically interesting case is studying the renormalization flows of 
the Landau-Ginzburg potentials of N = 2 superconformal symmetry. As we 
emphasized earlier, the only consistent theories of interacting superstrings have 
at least N = 1 supersymmetry once we restrain the one-loop amplitudes to be 
modular invariant. Thus, N = 1 space-time supersymmetry (or N = 2 super¬ 
conformal symmetry) seems to be the minimum symmetry required in model 
building. (If N = 2 space-time supersymmetry survived after compactifica- 
tion, then left and right multiplets would appear in the same supersymmetric 
representation, which is phenomenologically undesirable.) 



210 7. Toward a Classification of Conformal Field Theories 


Several new features emerge when we discuss renormalization flows and 
Landau-Ginzburg potentials for TV = 2 superconformal models. First, there is 
an interesting rotation one can perform on the generators of the N = 2 algebra, 
which turns the NS sector into the R sector and vice versa. Let us define an 
operator Ug that has the following properties: 


U 0 l L n U 0 

Ue'JnU, 

U 0 l G+U 0 

u 0 1 g;u 0 

It is easy to check that the deformed generators still satisfy the same commu¬ 
tation relations as the original algebra. Thus, the operator Ug maps the original 
Hilbert space into a rotated Hilbert space parametrized by 0. 

Under this rotation, the U(\) charge and dimension of a state shift by the 
following amount: 


— L n 4- 6J n + -0 8 n , o* 


Jn + 0 * 

GU- 


(7.4.1) 


q -> q + (c0/3), 

h h + 6q + {c0 2 / 6). 


(7.4.2) 


If 0 is half-integral, then the rotation maps the integer (half-integer)-valued 
G r operators into half-integer (integer)-valued operators. Thus, we have the 
most remarkable fact that, for 0 — Z + \, the NS algebra rotates into an R 
algebra and vice versa. This deformation of the algebra is called the spectral 
flow connecting the NS and R algebras [18]. To show that this spectral flow is 
not a fluke, an explicit representation of U 9 can be written if we introduce a 
scalar <p field, which bosonizes the J current 


J {z) = 90(z), U 0 = e w ^. (7.4.3) 

Next, we wish to construct representations of this N = 2 algebra, in order 
to construct the Landau-Ginzburg potentials. We make a few definitions. A 
left chiral NS field is one that satisfies 

G + _ y2 \(j>) = 0. (7.4.4) 

This notation comes from the theory of supersymmetry, where a chiral super¬ 
field 4>(x, 9) is one that satisfies D<f>(x, 6) = 0. (Since Q and D anticommute, 
placing this restriction on (p does not affect the fact that it is still a representation 
of supersymmetry.) 

Second, we define a primary field for the N — 2 theory as one that satisfies 
both 


G„+ 1/2I0) — G+ +l i 2 \<i>) — 0 , 


n > 0, 


(7.4.5) 



7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 211 


in analogy with the usual definition of primary fields for bosonic fields. A 
primary chiral field is one which satisfies both conditions. 

Let us take the commutator of these conditions 

{G7/2> G+ l/2 }\4>) = (2L 0 - JoM) = 0. (7.4.6) 

Therefore, a primary chiral field satisfies 

h = q/2. (7.4.7) 

(If the condition h — —q/2 is satisfied, then we call it an antichiralfield.) 
Similarly, if we take the commutator 

{G 3 - /2 , G+ 3/2 } = 2L 0 - 3L 0 + 2c/3, (7.4.8) 

then we have 

h < c/6 (7.4.9) 

for any primary chiral field. 

This has two very interesting consequences. First, it shows that there are 
only a finite number of primary chiral operators, which is unexpected. (This is 
because the dimension of each primary chiral operator is less and or equal to 
c/6, but the spectrum of L 0 is discrete, which can only be satisfied if we have 
a finite number of primary chiral operators.) Second, it shows that the algebra 
formed by taking operator product expansions of products of primary chiral 
fields produces a finite chiral ring R c hi ra i of operators [11-16]. 

If we take the operator product expansion of two primary chiral fields 0i 
and (f) 2 , then we will produce a composite operator <f)u with the following 
dimension: 

hn > \{Q\ + qi) = h\ + hi. (7.4.10) 

The product of primary chiral fields also produces primary chiral fields. Since 
there are only a finite number of them, we obtain a finite chiral ring F C hirai of 
such operators whose products form a closed system. 

Since many of the properties of an N — 2 superconformal field theory are 
determined once /? C hirai is fixed, our goal in the next section is to find some 
way in which to mathematically categorize the various possible R c hirai - This is 
where catastrophe theory enters our discussion of string theory. 


7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 

We now make contact with the N = 2 superconformal chiral rings and the 
Landau-Ginzburg formalism, which we began earlier. Previously, in Eq. 
(7.3.14), we constructed the most general superpotential involving the chi¬ 
ral superfields O; and The first contained an integration over all four 0’s, 
and is called the D term, while the second contained an integration over only 
two 0’s, meaning that F(O) is a chiral superfield. 



212 7. Toward a Classification of Conformal Field Theories 


Let us introduce a new F term, called W. Let us scale the superfields x t 
contained within W according to x,- X ni Xj. Then, we define W to have the 
following scaling property: 


Wik^Xi) = k d W{Xi). (7.5.1) 

If W has 17(1) charge (1, 1), then this means that X, must have charge — 
n, jd. We see that each superfield scales differently according to n ,, but that the 
overall function W scales by the same amount, regardless of how it depends 
on the various v,. 

Our next task is to construct the ring formed by forming all products of the 
superfields x,, modulo terms that contain factors of 3 j W(x,). This ring has a 
finite number of terms, and can be written symbolically as 

*“=iwr <7 - 5 2) 

(This means that whenever factors of the derivatives of W appear in the mono¬ 
mials formed by x i9 we set them to zero). The interesting conclusion that we 
will draw is that, for a wide variety of models, the two rings are the same 
[11-16]: 


^chiral = ^LG- (7.5.3) 

This is a powerful result that will significantly help us in the task of categoriz¬ 
ing the possible N = 2 superconformal field theories via Landau-Ginzburg 
potentials. 

Second, we can further the identification of superconformal theories by 
calculating their central charge. This will help to identify the various possible 
conformal field theories. Let us rescale the two-dimensional metric on the 
world sheet by an overall factor X. In this case, the partition function Z for a 
conformal field theory defined on a sphere also gets rescaled. The effect of this 
rescaling has already been computed using functional methods [11, 19]: 

Sab ^ X Sabi 

Z -> [ exp In X I R 
V 48tt J 

where R is the curvature on the world sheet. The last integral can be evaluated 
since the integral of the curvature tensor yields in for the sphere. 

The previous result was independent of the specific model we are analyzing. 
Now, let us take a specific model and perform this rescaling. We have to take 
the product of several different contributions. 

First, we have the rescaling of the functional measure. The measure gets 
rescaled by X c/6 , so we must calculate the c for each superfield. Each boson 
contributes c = 1, and each fermion c = \. Because a superfield has a complex 
boson and a complex fermion, the value of c is 3, so the measure scales as X l/2 . 


) 


Z = k c/6 Z, 


(7.5.4) 



7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 213 


Next, we have the rescaling of the W term, because the fields rescale as 

<t>, -> (7.5.5) 

where d t is the U(l) charge of the field. 

Now, let us calculate the contribution to the functional measure due to rescal¬ 
ing, using the fact that the potential is quasi-homogeneous. The contributions 
are 

bosons: x~ 2d ‘ Jr(l \ Tr(l) =-— f R = —± 

24 nj 3 ’ 

fermions: A+ 24Tr(1 \ Tr(l) =-— 

v 48^ 

so that the Jacobian contributes a total factor of k ~ di . 

Putting all factors together, we find that the partition function scales as 

Z -► A a Z, As^(f- di). (7.5.7) 



Since this scale factor must equal X c/6 , we have 

| = E G - 4) < 7 - s - 8 > 

i 

so c is simply defined via the U( 1) charges d t of the various independent 
superfields within the potential. 


Example: Free Boson on a Circle 


To illustrate these ideas, let us consider several examples. First, let us consider 
the simplest possible N = 2 superconformal model, the theory of a free boson 
0 (with c = 1) defined on a circle with fixed radius. One finds that an explicit 
representation of the N = 2 algebra is given by 


— e ±iVl<t>L Q± _ e ±iVi<pR 

J(z) = 07 V 3) d<t>, H{z) = -0/V3) dd>, 


(7.5.9) 


subject to the condition that the allowed winding (momentum) modes are of 
the form 


exp[i(n L 4>L ~ nR<j>ft)/Vl2 ] (7.5.10) 

with n L — nR = Omod 6 (before a GSO projection). 

The only primary chiral states are the vacuum and the state that has h L = 
q L /2 = h R = q R /2 = which we denote by X. Since the product of primaries 
is either another primary or zero, we find that X 2 is not a primary and hence 
must be zero. The chiral ring of this superconformal model is simple: 

/? chiral = {!,*}, XX = 0. 


(7.5.11) 



214 7. Toward a Classification of Conformal Field Theories 


Now, compare this chiral ring with the ring formed by starting with the 
Landau-Ginzburg potential 


W = jt 3 , (7.5.12) 

where r is a chiral superfield (not the X of the previous discussion). The 
Landau-Ginzburg ring is formed by taking all possible products of 1 and x , 
modulo all possible derivatives of W , that is, modulo x 2 . But this leaves only 
two elements in the ring, 1 and x; so 


ftLG = {h*}, x • x = 0. (7.5.13) 

Comparing Eqs. (7.5.11) and (7.5.13), we find that we are back to the same 
ring structure as the chiral ring, that is, ^chiral = Rlg- 

The final link between these two rings is their central charge. We know that 
c = 1 for the chiral ring. If we scale by x X ni x, then, from Eq. (7.5.1), 

(X n 'x) 3 = X d (x 3 ), (7.5.14) 

so that 3ni = d, or that q\ = n\/d = Since the central charge c in Eq. 
(7.5.8) equals 3 — 6 q, we find that c — 1, as expected. 

Example: Catastrophe Theory 

Fortunately, it is now possible to use the mathematical theory of catastrophes 
(singularity theory) [16] in order to classify the various types of supersym¬ 
metric Landau-Ginzburg potentials. Catastrophe theory, like superconformal 
field theory, is interested in the behavior of functions such as W under a rescal¬ 
ing. Let us introduce a few simple definitions from catastrophe theory. Let the 
dimension of the ring /?lg equal fi, which is called the criticality type. For 
example, ix = 2 in the previous discussion. 

The question that we will address is: if we add a deformation 8 W to W 9 will 
the deformation change the criticality type? We will therefore introduce the 
“modality” of a singularity, that is, the number of parameters in 8 W that one 
can add to W without changing (x and that cannot be eliminated by a coordinate 
transformation. 

It can be shown that the list of potentials with zero modality can be arranged 
according to an A-D-E classification, that is, there is a one-to-one correspon¬ 
dence between a zero modality potential and a self-dual Lie group. If N is the 
Coxeter number of the Lie group, then it can be shown that the zero modality 
potentials have the central charge 


c 



(7.5.15) 



7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 215 


This means that the zero modality potentials can be represented as 

(* > 1), 

(k > 2 ), 


A k : 

r fc+l 

A, , 

1 

m 

II 

6 

k + r 

D k : 

x'-'+xy 2 , 

r — ^ — 

6 

C ■ J 

2{k-\y 

E 6 : 

x 3 + y 4 , 

c = 3 - 

6 

12’ 

En : 

x 3 +xy 3 , 

c = 3 — 

6 

18’ 

E % : 

x 3 + y 5 , 

c = 3 — 

6 

30 ’ 


(7.5.16) 


Let us analyze some of these examples in more detail. 
Example: A k 


Using the definition of a quasi-homogeneous function, we can calculate the 
U (1) charge for each of the variables in the potential, and then we can calculate 
the central charge. For the first example, A*, the charge of x can be calculated 
using the quasi-homogeneous equation Eq. (7.5.1): 

W(X n ’x) = x (M)nx x k+l = X d W{x), (7.5.17) 

which gives us the t/( 1) charge 


«£ _ 1 
d k 1 


(7.5.18) 


We can then plug this expression for the charge into the equation for the central 
charge, Eq. (7.5.8), giving us 


c 1 

3 = 1 ~ 2 k + r 


(7.5.19) 


which is the expression in Eq. (7.5.16). 

Next, we can calculate the ring associated with this potential by taking all 
possible monomials (x") modulo the derivative 


dW/dx = (k+ 1)**. (7.5.20) 

This, in turn, means that x k ~ 0, so the elements in the ring stop at x k ~ l : 

R LG = {l,x,x 2 ,...,x k - 1 }. (7.5.21) 

The criticality type is p — k. The modality is also zero. [This is because 
we cannot add W to W without changing p. If we add x'" (m < k + 1), 
then, since the derivatives of W are changed, we find that p also changes. If 
we add x m (m > k + 1), then this can be absorbed by a general coordinate 
transformation.] 




216 7. Toward a Classification of Conformal Field Theories 


Example: Dk 


For the second example, D*, we calculate the charges for x and y by solving 
the quasi-homogeneous equations 


X nn * 

^ n x +2n y 



(7.5.22) 


where n = k — 1. This gives us n x = d/n and n y = d(n — l)/2 n. Now, let us 
insert these values for the U( 1) charges into the equation for the central charge 
in Eq. (7.5.8), and we find 


c „ 2 n — 1 

2 - 

3 n n 


(7.5.23) 


giving us the expression in Eq. (7.5.16). 

Now, let us calculate the Landau-Ginzburg ring for this potential, which is 
constructed out of all possible monomials x‘y J modulo the derivatives of W, 
that is, we set equal to zero the following: 

dW/dx =nx n - l + y 2 ~0, dW/dy = 2xy ~ 0. (7.5.24) 


It is now easy to show that the complete set of monomials is equal to 

R lg - {l,x,x 2 ,... ,x n ~ i , y}. (7.5.25) 


Then we have p = n + 1, and the modality is zero. 


Example: E(, 

The third example, E 6 , can also be analyzed the same way. The rescaling of 
W gives us 

+ k 4n> 'y 4 = k d {x 3 + y 3 ), (7.5.26) 

which gives us n x = d/3 and n y = d/4. Examining the central charge with 
these values for d,, we find the value in Eq. (7.5.16): 

c = (3-f) + (3-|) = 3-i. (7.5.27) 

Now, let us calculate the derivatives of the potential 

dW/dx = 3x 2 ~ 0, 3W/dy = 4y 3 ~ 0. (7.5.28) 

The elements of the ring are, therefore, 

RuG = {Ux,y,xy,y 2 ,xy 2 }. (7.5.29) 

The criticality type p. — 8 and the modality is zero. 

Example: E% 

The last example, E%, has charges for * and y given by j and respectively. 
The value of the central charge is given by (3 — |) + (3 — |) = 3 — , which 



7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 217 


agrees with the value in Eq. (7.5.16). Since the derivatives of the potential are 
x 2 and y 4 , the elements of the ring must be given by 

/?LG = {\, x, y, xy, y 2 , xy 2 , y 2 , xy z ). (7.5.30) 


Then, /i = 8 and the modality is zero. 

Armed with this new, powerful formulation, let us now reinvestigate the 
miraculous relationship between N = 2 superconformal field theories and 
Calabi—Yau manifolds which we studied in Chapter 5. Each formulation is 
based on a different series of assumptions and mathematics, yet both seem to 
be equivalent. We end this section by presenting a heuristic argument which 
attempts to explain the deeper, underlying reason why tensoring N = 2 
superconformal field theories yields Calabi-Yau manifolds [20], 

Let us begin by studying N = 2 superconformal field theory by considering 
a Landau-Ginzburg superpotential given by the F term VT(4>) = 4> />+2 . We 
know that, at criticality, this simple superpotential yields an N = 2 minimal 
theory of level P with central charge given by Eq. (7.3.15): c = 3P/(P+2).ln 
this language, describing the tensor product of several N = 2 superconformal 
field theories is rather simple: we just add several superpotentials together 

W(d> ; ) = d>f> +2 + • • • + <p£"+ 2 . (7.5.31) 

For example, the (3 5 ) model discussed earlier, formed by tensoring five 
copies of the P = 3 discrete series, corresponds to a superpotential with 
W(®) = Y^i =1 Our goal is to show the relationship between this super¬ 
potential and the Calabi-Yau manifold given by F 4 . 5 , which is given by CP 4 
constrained by z \ + z s 2 + + z\ + £5 = 0 . 

To reveal the relationship between the tensoring of N = 2 minimal models 
and Calabi-Yau manifolds, let us analyze the path integral defined over this 
superpotential given by 


j D^>i • • ■ D<$> n exp i j d 2 z d 2 6 (<6 Z / H-h <S>^) 


where l t are integers. 

Let us now define the variables 


& = ^ = <*>!'/ 

By factoring out £ 1 , the original path integral can be written as 
j D§, • • • D$ n £2 exp i j d 2 zd 2 0 + • • •+?£)] , 


(7.5.32) 


(7.5.33) 


(7.5.34) 


where Q is the Jacobian for this coordinate transformation from <£•' to . It 
is easy to show that this Jacobian is proportional to where j = 1 — l\ + 



218 


7. Toward a Classification of Conformal Field Theories 


/i(5^f =2 (l/A))- Therefore, the Jacobian drops out if we set j = 0, or 

N j 

£r = L (7-5.35) 

1=1 Li 

If this condition is met, then Q = 1 and the integration over can be 
performed, yielding the following delta function: 

6( 1+# + •••+$£)• (7.5.36) 

This complex constraint is identical to the constraint found in what is called 
weighted CP N manifolds. The manifold described by this delta function is 
identical to the manifold arising from the constraint 

X>i=°- (7-5.37) 

1 = 1 

Assuming that z\ is regular at the origin, we see that we can divide by z\ and 
arrive at the same condition as the delta function. We can make the identification 

£/ = Zi/zi- 

This weighted CP N -\ manifold is defined as a complex space where we 
identify the point [z\ , Zi ,..., Zat] with another point given by 

[zuZ 2 , --•,Zn] = [k kl Zu • • •, ^ n Zn] (7.5.38) 

for some complex X. We call this manifold WCP k ~} kN . If we apply this transfor¬ 
mation on the constraint defined by the delta function, we see that the constraint 
remains invariant. (Notice that if all the integers £, are identical, then we have 
an ordinary CP N -\ manifold.) 

To make the identification more precise, let d be the least common multiple 
of the integers /* . Then the superpotential Y^= i the delta function, 

corresponds to the space 

WCP^ d/h . (7.5.39) 

Lastly, we remark that the condition £2 = 1 also has a counterpart from the 
point of view of complex z space. It turns out that this condition is identical to 
the vanishing of the first Chem class c\ for the manifold. Since a Calabi-Yau 
manifold is a complex Kahler manifold with vanishing first Chem class, we 
have now shown that the superpotential 4 ) ; corresponds to a Calabi-Yau 
manifold if the Jacobian £2 = 1. 

In summary, the integration over the superpotential has become the defining 
relation for the weighted CPn~\ manifold, and the vanishing of the Jacobian 
has become the condition for the vanishing of the first Chem class, yielding a 
Calabi-Yau manifold 




7.5 N = 2 Landau-Ginzburg and Catastrophe Theory 219 

(We caution, however, that there are some loose ends in this heuristic deriva¬ 
tion. For example, we assumed that we could ignore the D term appearing in 
the super path integral. This is a reasonable, though not rigorous, assumption, 
because we expect the D term to contribute only small perturbations to the 
theory.) 

As a check on our results, let us investigate the c = 9 theories found by 
Gepner. Since p t = U — 2, we can write the central charge corresponding to 
tensoring superconformal field theories 


5 

c = ^3(/,-2)//, =9. (7.5.41) 

1 = 1 


This, in turn, can be reduced back to i (1 //,•) = 0, which is precisely the 
condition for the Jacobian being equal to 1. Once again, this method reveals 
the origin of the relationship between the two different formalisms for the case 
c = 9. 

In this fashion, it is now easy to write the Calabi-Yau manifold which 
corresponds to the tensoring of various superconformal field theories [20]: 


(3 5 ) 
(4 4 1) 
( 6 4 ) 
(8 3 3) 
(7 3 l 2 ) 
(10 2 2 2 1) 
(5 2 1 2 19) 
(16 3 1) 


+■ z 5 2 + zl + zl + zl = 0 e CP 4 , 

+ Z2+zi + z* + zl = 0e WCP? xlx2 , 

■f zl + zl + zl + zj = 0 € WCP 4 X1X6 , 

+ z\° + Z? + zl + 4 + zl = 0 6 WCP 4 UXS , 
■f zl + 4 + zl + zl = 0 € WCPf XU j, 

+ z l 2 2 + zi + z 4 4 + zl = 0e WCP 4 xxxa , 

+ zl + zl + zl + z\ = 0 e WCP 4 3 3>7 7 , 

+ zi s + zl 8 + z? + z? = 0 € WCPf , ,, 


(7.5.42) 


We should also mention that the class of potentials that we have been consid¬ 
ering, of the type d> p+2 , only corresponds to one type of manifold. Earlier, we 
saw that there is an A-D-E classification of zero modality singular functions. 
The A P singular functions are of the form z p+l which we have considered so 
far. 

For the D P + 2 series, we need to use monomials of the form z p+l + zw 2 , 
while monomials of the form z 3 + w 4 , z 3 + zw 3 , and z 3 + w 5 are in the 
E 6 , E ly and E% series, respectively. In this way, it is also straightforward to 
construct the correspondence between tensoring superconformal field theories 
and Calabi-Yau manifolds for the D-E series, as well. 




220 


7. Toward a Classification of Conformal Field Theories 


7.6 Zamolodchikov’s c Theorem 

One major defect in the previous presentation has been the fact that our dis¬ 
cussion has focused on systems that were exactly conformally invariant at 
criticality. From the string point of view, this means that we have been study¬ 
ing vacuums that are on shell. However, this does not tell us which vacuums 
the theory prefers and how it tunnels between possible vacuums. Some insight 
can be gained by studying conformal theories that are allowed to go off criti¬ 
cality. For example, in solid-state physics, two-dimensional systems, such as 
the Ising model, are exactly solvable both at criticality as well as off criticality. 
Thus, we should reexamine our approach to conformal systems by analyzing 
our equations off criticality. 

The c theorem [10] provides a powerful way in which to analyze systems 
off criticality. In short, the c theorem states that there is a function C with two 
properties. First, at criticality, the function C reduces to the usual central term c 
for some conformal field theory. Second, the value of C along renormalization 
group flows decreases. 

The proof of this theorem is quite general and deceptively simple. It is 
based on analyzing the full energy-momentum tensor T ab for systems that are 
not critical. 

For example, T ab can be broken down into four pieces: 

(1) the antisymmetric piece T^; 

(2) the trace, given by ©; and 

(3) the symmetric parts T — T zz and T = T- Z i . 

The energy-momentum is conserved, which means d a T ab = 0, or 

%r + ±a z © = o, d z t + ^© = o. (7.6.1) 

The antisymmetric piece can be set equal to zero if the system is rotationally 
invariant, which we will assume. Normally, for conformally invariant systems, 
we also set the trace © equal to zero. This, in turn, means that T (T) is a function 
of z (z). However, we will now keep the trace arbitrary for a noncritical system. 

Because 7,0, and T have conformal spins equal to 2,0, —2, we can write 
the new operator product expansion as 

{T(z, z)T{ 0, 0)) = F(zz)/z 4 , 

{; T(z , z)©(0,0)) = G(zz)/z 3 z, (7.6.2) 

(0(z, z)©(0, 0)) = H(zz)/z 1 2 3 z 2 . 

Now, let us take the correlation function between the equation of motion and 
T(0, 0) or 0(0, 0), that is, 

{{diT + H©)m 0)) = 0. 


(7.6.3) 



7.7 A-D-E Classification of c = 1 Theories 221 


We then find two equations 

F + \{G - 3 G) = 0 , 

G — G + \{H - 2H) = 0, 


(7.6.4) 


where we have defined F = zzF'(zz)- 

Now, we can define the function C, which reduces to the central term c at 


criticality 


C = 2F -G - \H 

(7.6.5) 

which obeys the equation 


C = -\H. 

(7.6.6) 


By reflection positivity, we know that H > 0, so C is a decreasing function of 
R = Vzz. 

In a theory with coupling constants {g,}, we can write a renormalization 
group equation for C(r, {g}) as follows: 


dR *-7* dgj 


CXfgih R) — 0. 


(7.6.7) 


Notice that, at a fixed point where /?, = 0, we have G = H = 0 and F = \c, 
so that C = c at the fixed point. 

In summary, we have now shown that, if renormalization flows connect 
different conformal field theories, then C decreases along the flows and that 
C — c at criticality. 


7.7 A-D-E Classification of c = 1 Theories 

So far, we have reviewed the major methods that have been devised which can 
give us, for c < 1 , a complete classification of the unitary representations of the 
conformal group, including its modular invariants, in terms of finite numbers 
of primary fields. For c > 1, there are an infinite number of primary fields and 
much less is known about their representations. 

Questions remain, however, about the case c — 1. Will it behave more like 
c < 1 and give us exactly solvable representations, or will it behave more like 
c > 1 and be, at least with present methods, intractable? 

The answer is rather unexpected. We find that at c — 1 we can find the 
complete set of unitary representations, as in the c < 1 case [ 21 ]. 

A careful analysis shows that there are only three classes of solutions, 
corresponding to a boson propagating on: 

( 1 ) a torus of radius r\ 

( 2 ) an orbifold parametrized by radius r; and 



222 7. Toward a Classification of Conformal Field Theories 


(3) three discrete orbifold spaces defined on SU( 2)/ T*, where T,- are discrete 
elements of 5(7(2). 

The first two solutions represent continuous classes of solutions which can 
be parametrized by r, where 0 < r < oo, while the third solution is discrete. 

To understand how to construct these c = 1 representations, we begin with 
the usual action for a spin -0 boson 

S-(7.7.1) 

If we compactify this on a circle 5 l with x being identified with x 4 - 2nr 
and trace over ^ 0 - 1 / 24 ^ 0 - 1 / 2 ^ we g nc [ potion function Z(r) which we 
calculated earlier in Eq. (4.2.17), which obeys the duality condition 


Z(r) = Z(l/r). (7.7.2) 


Notice that we have a continuous set of solutions defined on 5 1 indexed by 
the radius r. 

We obtain the second set of continuous solutions when the boson propagates 
on an orbifold. In particular, let us divide out by the discrete symmetry x 
— x, so the boson propagates on S ! /Z 2 instead of the circle 5 1 . This alteration 
leaves c invariant, but changes the boundary condition for the trace operation. 
In general, if we perform the functional integral over an orbifold where we 
have divided out by the action of a discrete group G with elements g i9 then we 
must sum over all possible boundary conditions in the <j\ and a 2 directions, 
with gi acting on both sides of the parallelogram. Since the discrete group G 
has two elements, the identity and the parity operator, we must therefore sum 
over four possible boundary conditions. 

Evaluating the trace over these new boundary conditions, we find 


Z-M = I| Z(r) + ^ + ^ + ^l 

2 1 7777 77/7 7777 J 


l 2 [Z(r) 


W 

■2 Z 2 -Z 2 ], 


(7.7.3) 


where we define 


Z„ = Z(1/tiV 2) = Z(n/V2) (7.7.4) 


and where #,■ = ( 0 , t). 

Then we have the desired result for the partition function over the Z 2 orbifold. 
By explicit calculation, one can show that both partition functions are modular 
invariant. 

We have now constructed modular invariants for two continuous classes 
of representations. Before discussing the third solution, we note that within 
these two continuous classes given by the torus and the orbifold, there are 
interesting special values for r which yield some insight into the structure of 
these solutions. 



7.7 A-D-E Classification of c = 1 Theories 223 


For example, we can reexpress the above results in the language of bosons 
propagating on the orbifold SU( 2)/ T, where r represents the various discrete 
subgroups of SU (2). This means that the point g € SU (2) is equivalent to the 
point hgh~ l if her. 

If we choose h = exp(27r/ 3 /zz), then the elements 7 3 and J± of St/(2) must 
be identified according to the following: 

hhh~ x = 7 3 , /I/*/*" 1 = e ±l7ti/n J ± . (7.7.5) 

This simply means that the point x is to be identified with the point jc + 
2n/{n^/2). In other words, the boson propagates on a circle S 1 with discrete 
radius 1 /(ny/2). This just selects out special radii for the circle. 

The element h generates the discrete subgroup of SU( 2) called the binary 
cyclic group C 2 « • This is a group of order In , whose projection C n = C 2n /Z 2 e 
5(9(3) is the group which describes rotations about an axis of zz-fold symmetry. 
Thus, there is a one-to-one correspondence between the special radii we have 
found for the torus and the elements of this particular discrete subgroup of 
SU( 2). 

We can also establish 


Z[SU{2)/C ln ] = Z n . (7.7.6) 

Similarly, we can also choose another discrete subgroup T to be generated 
by the element h = exp(/jr J\). The group action yields 

hJ 3 h~ l = —/ 3 , hJ±h~ l = J T . (7.7.7) 

This identification, in turn, can be shown to correspond to identifying x -> 
—jc, as before, for the S 1 /Z 2 orbifold. If we combine the action of h and h, then 
we can describe propagation on the orbifold S 1 /Z 2 with the orbifold radius 
being r — \/{n\/2). 

These special radii can be placed in one-to-one correspondence with the 
elements of the binary dihedral group V n , which is a discrete subgroup of 
SU{2 ) of order 4 n formed generated by h and h. Their projections D n = 
T> n / Z 2 € 50(3) have n axes of two-fold symmetry perpendicular to an zz-fold 
axis. For example, the group D 4 corresponds to the 8 element symmetry group 
of the square. 

We can also establish 

Z[SU( 2)/D 2 ] = \(Z n + 2Z 2 - ZO = Z orb (l/(nV2)). (7.7.8) 

This concludes our discussion of the first two continuous classes of solutions 
and their special points. Now, we describe the third class of solutions to the 
c — 1 theory, which is given by three discrete solutions. In addition to the 
C n and V n discrete subgroups of SU( 2), there are also the binary tetrahedral, 
octahedral, and icosahedral groups, labeled by T, O , and J of order 24, 48, 
120, which are related to the symmetry groups of the regular polyhedra found 
in ordinary solid geometry. 



224 7. Toward a Classification of Conformal Field Theories 


We can calculate the modular invariants associated with each of these three 
discrete subgroups by breaking them down further into their various elements 
and expressing them as rotations about various axes of the regular polyhedra. 
For example, the mutually commuting elements of the discrete subgroup T lie 
in four C 3 ’s (acting about axes through the centers of the various faces) and 
one D 2 (of rotations about axes through the centers of the opposite edges). 

We find, therefore, that a modular invariant combination for SU(2)/T is 
given by 


Z[SU(2)/T] = £{4(3Z 2 - Z - 1) + 4Z[SC/(2)/X> 2 ]} 

= 5 ( 2 z 3 + Z 2 - ZO. (7.7.9) 

For the octahedral group O, its generators lie in three C 4 ’s (acting about axes 
through the centers of opposite faces of a cube), four C 3 ’s (acting about axes 
through antipodal vertices), one D 2 (containing the elements of three C 4 ’s), and 
three D 2 ’s (each of which contains one of these elements and two others which 
are associated with orthogonal axes through the centers of opposite edges). 

We find, therefore, 

Z[SU(2)/0] = £{3(4Z 4 - 2Z 2 ) + 4(3Z 3 - Z - 1) 

+ 4Z[SU(2)/V 2 ] + 3(4Z[SU(2)/V 2 ] - 2 Z 2 )} 

= \( z a + z 3 + Z 2 - z,). (7.7.10) 

Finally, the icosahedral group’s elements lie in six C 5 ’s (acting about axes 
through opposite faces of a dodecahedron), ten C 3 ’s (acting about axes through 
antipodal vertices), and five D 2 ’s (comprised of the rotations about axes through 
the centers of opposite edges). We have, therefore, 

Z[SU(2)/T\ = £{(5Z 5 - ZO + 10(3Z 3 - Z,) 

+ 5(4Z[SU (2)/X> 2 ] — ZO + Z, } 

= i(z 5 + Z 3 + Z 2 - ZO. (7.7.11) 

We now remark on a curious mathematical fact. When we analyzed the mod¬ 
ular invariants of SU(2) k , we noticed that they could be placed in one-to-one 
correspondence with the A-D-E classification of simply laced groups (which 
have simple root vectors of the same length). We now show that the special 
solutions of the c = 1 theory for the two continuous classes and the three 
discrete solutions can also be placed in a one-to-one correspondence with the 
A-D-E classification. This is because there is a one-to-one correspondence 
between the simply laced groups and the finite subgroups T of SU( 2). Since 
each of the discrete solutions that we have found can be described by propa¬ 
gation on SU( 2)/ T, we now have a one-to-one correspondence between the 
simply laced groups and the complete discrete modular invariant solutions of 
the c — 1 series. 



7.8 Summary 225 


In particular, we find that the special values for the two continuous series 
can be identified as 

SU(n) = A„_! C n , SO(2n) = D n P n _ 2 , (7.7.12) 

while the three discrete solutions can be identified as 

E e 0\ E s ** X. (7.7.13) 

Lastly, it can be proven that the three classes of solutions that we have 
found for the c — 1 theory are, in fact, the only solutions [22]. Thus, we 
have now achieved a rather interesting result, the complete classification of the 
representations of the c — 1 theory. 


7.8 Summary 

At present, there is no generally accepted classification scheme that reveals the 
deep relationship between various conformal field theories. Several methods 
have been devised that can catalog vast numbers of conformal field theories, 
especially the rational conformal field theories, although none of these methods 
give us much insight into the relationship between the theories or how to select 
out the true vacuum of the string. The various methods are summarized here. 

(1) The GKO coset method is still one of the most powerful methods by which 
all known rational conformal field theories may be represented. However, 
one has to know the representations of G and its subgroup H in order to 
determine the representations of the coset G/H. 

(2) The Feigin-Fuchs method uses a set of almost trivial free fields in which 
to generate the primary fields of a wide variety of models. It can include 
the Kac-Moody algebras, cosets, and N = 2 superconformal field theory. 

(3) The Landau-Ginzburg method and catastrophe theory is not as powerful 
as the others, since some conformal field theories cannot be represented 
in this fashion, but it is the most physical and may eventually explain how 
one conformal field theory may “flow” into another. 

(4) The Chem-Simons knot theory method can also represent all known 
conformal field theories. It will be presented in the next chapter. 

The Feigin-Fuchs free field method begins with an energy-momentum 
tensor with a linear term 

T(z) = : 3 Z 09 Z <£ : +z‘a o 3 z 0. (7.8.1) 

Then the central term can be calculated to be 

c = 1 - 12^0 (7.8.2) 

and the conformal weight of the vertex function: e : is equal to a 2 — 2aa 0 . 



226 7. Toward a Classification of Conformal Field Theories 


Now, consider the operator 

Q = £ dz J{z), J{z) =: e ia * {z) : . (7.8.3) 

If we choose ao such that the integrand has weight 1, then we have 

a 2 - 2aa 0 = 2 (7.8.4) 

which has two solutions for a 0 -* 

a± = a 0 ± yJaf+2. (7.8.5) 

The trick behind the Feigin-Fuchs method is that, because Q± is confor¬ 
mally invariant, we can insert as many of them into a correlation function 
as required until the sums of the momenta add up to zero. For example, the 
correlator 


(V a V a V a V 2ao . a ) 


(7.8.6) 


is nonzero if we put momentum —2a 0 at infinity and insert n — 1 {?_’s and 
m — 1 <2+’s. Then, the sum of the momenta must be zero, so 

2a = 2a n , m — (1 — n)a- + (1 — m)a+. (7.8.7) 

The conformal weight of the vertex V a must therefore be 

2A nm = a\ m - 2a„, m a 0 - |[(«-« - a+mf - (a+ + a-) 2 ]. (7.8.8) 


But, this is just the conformal weight of a minimal field. Since the correlator 
of vertex functions is trivial to calculate, we have now reduced the problem 
of finding the correlation function of the minimal model to evaluating line 
integrals (arising from Q ). For example, 

(1M.4 ,m4 (00)) 

= £ dt{V ai (0)V a2 (z)V ai (l)V ai (oo)J + (t )) 

= z 2 “'“ 2 (1 - z) 2 “ 2 “ 3 £ dt t a (t - 1 f(t - z) c , (7.8.9) 

where a = 2a\a + , b = 2 o? 3 Qf + , c — 2a 2 a + , a 2 = — ^a + , and a 4 = 2ao — 
of i — a 2 a 2 — a + . 

The Feigin-Fuchs method can be generalized by introducing more free fields 
y a , and cj)\ with an operator product expansion 


P- a (z)Yp(w) ~ 

z — w 

<j) 1 (w) ~ — S lj In (z — w). 


(7.8.10) 



7.8 Summary 227 


With these free fields, one can represent the generators of a Kac-Moody 
algebra. For example, the generators of the Cartan subalgebra can be written as 

H‘(z) = —ia+ 3 <j)‘ + ‘P- a y a (z), (7.8.11) 

«€G 

where the sum runs over positive roots and where a + = *Jk + g, and the other 
generators can be written as 

E_ a (z) = (i- a (z)+ N paYp {z)p- a {z). (7.8.12) 

p—a——a 

Also, the energy-momentum tensor can be written as 

| rank G • rank G 

T(z) = 5>-*3 y a - - £ [3 <t>\z)f - 3 V(z), ( 7 - 8 - 13 ) 

aeG Z i =1 i=l 


where p l is half the sum of the positive roots. 

Furthermore, we can write the free representation of the coset energy- 
momentum tensor 

Tg/h(z) = d y- \\. d P(z)Y - vPJ PG 3 2 0(z) 

ceeG/H 

- [-jimz)] 2 -jkhPH 3<^'(z)} • (7.8.14) 

In addition, this method easily generalizes to the N = 1 and N = 2 
superconformal cases. 

Next, we study the Landau-Ginzburg approach, which begins with the 
observation that a theory with the potential 

L ~ g J d 2 x $ 2 ( p ~ l) (7.8.15) 

is not usually conformally invariant except at criticality. At this point, however, 
it must equal one of the classes of conformal field theories that has been 
proposed. Specifically, the potential <t> 2(p_1) at criticality corresponds to the 
unitary minimal model with c = 1 — 6/p(p + 1). 

To see the relationship between a Landau-Ginzburg potential at criticality 
and a standard conformal field theory, take the fusion relations for a minimal 
model 

~ J2 X ^[<l>n + k, m+l + • ' • ] (7.8.16) 

k,l 

and compare them to the operator product expansion of the Landau-Ginzburg 
theory. By examining these relations, we can show the correspondence between 
conformal field theories and Landau-Ginzburg models 

: <3>* 0jh-i,*+i, k = 0, 1,... , p — 2, 

: <I>* (f> k - p+ 2,k-p+2, k = p - 1, p, p + 1, ..., 2p - 4. 


(7.8.17) 



228 7. Toward a Classification of Conformal Field Theories 


Similarly, we can analyze the N = 2 theory and find the relationship 

* = 0,l,2,...,p-2. (7.8.18) 

The situation for the N = 2 theories, however, is more complicated. First, 
we have the fact that, by defining a Uq operator, one can smoothly transform 
the generators from NS to R: 


U d x L n Ug — L n + 6J n + —0^S n Q, 

o 

Uq 1 JnUg = J n + -#<$«, 0» 

U?GtU e = G?+g, 

Ug X G~U e = G+ „ 

where the U(\) charge and dimension change as 

q^q+ (c0/3), 
h —► h + Oq 4- (c9 2 / 6), 


(7.8.19) 


(7.8.20) 


where 0 will determine the spectral flow of the theory. 

We define a primary field as one that satisfies 

Gn+ifiW) = Gt +y2 \4>) = 0 (7.8.21) 

in analogy with the usual definition of primary fields for bosonic fields. What 
is interesting is that the algebra formed by these chiral primaries forms a closed 
ring. The strategy is then to find the equivalence between this chiral ring /? C hirai 
and the algebra formed by a Landau-Ginzburg potential. 

Given a Landau-Ginzburg potential, we can form the ring that is created by 
taking all possible monomials of the fields, divided by all possible derivatives 
of the potential, that is, 


n x- 

Rlg = (7.8.22) 

G [djW] v ' 

The object is to establish the relationship 

^chiral = ^LG* (7.8.23) 


For example, the simplest representation of the chiral ring is in terms of a free 
boson defined on a circle. We can represent the N — 2 generators as 

G ± = e ±i ' / ^ 4 ' L , G = e ±i ' /i4 ‘ R , 

’ . V - < 7 - 8 - 24 ) 

J(z) = (i/V3) 3 4>, H{z) = -(//V3) 30. 


The only primary chiral states are the vacuum and the state that has h L — 
qi/2 — h R = q R l2 = f, which we denote by X. The chiral ring of this 
superconformal model is simple 


^chiral — { L X }, 


XX = 0. 


(7.8.25) 



7.8 Summary 229 


Now, compare this chiral ring with the Landau-Ginzburg potential 

W = x 3 (7.8.26) 

where x is a chiral superfield. The Landau-Ginzburg ring is formed by taking 
all possible products of 1 and x 9 modulo all possible derivatives of x , that is, 
modulo x 2 . But, this leaves only two elements in the ring, 1 and x ; so, 


*lg = {1,*} x • x = 0. (7.8.27) 

Notice we are back to the same ring structure as the chiral ring, R c h i ra i = /?lg- 
A more systematic search for the equivalences between R C him\ and R LG 
involves analyzing the scaling property of the potential 

W(k ni Xi) = \ d W(Xi). (7.8.28) 


However, this equation is also the defining equation for a class of catastrophes 
that has been studied by mathematicians. Thus, systematic study of catastro¬ 
phes (which have been cataloged) may give us insight into the classification 
of these types of superconformal field theories. 

Last, we analyze the possibility that various conformal field theories may 
flow into others via the renormalization group equations. This may give us a 
clue as to the nature of the true string vacuum. When the transition is made 
between one conformal field theory and another, the theory goes off critical¬ 
ity, so it is important to reanalyze the energy-momentum tensor when scale 
invariance is violated and it has a nonzero trace, denoted by T and t. With a 
nonzero trace, the conservation of the energy-momentum tensor becomes 

d- z T + \ 3,0 = 0. (7.8.29) 

Now, let us take the correlation function between the equation of motion 
and T (0, 0) or ©(0, 0), that is, 


{{d- z T + \d z @)n 0,0)) = 0. 

We then find two equations 

F + \(G-3G) = 0, 
G - G + \(H - 2H) = 0, 


(7.8.30) 


(7.8.31) 


where we have defined F = zzF'(zz)- 
Now, we can define the function C, which reduces to the central term c at 
criticality 

C = 2F-G-\H (7.8.32) 

which obeys the equation 

C = -f H. (7.8.33) 

By reflection positivity, we know that H > 0, so that C is a nondecreasing 
function of R = -Jzz . 



230 7. Toward a Classification of Conformal Field Theories 


In a theory with coupling constants {#;}, we can write a renormalization 
group equation for C(r, {g}) as follows: 


dR dgi 


c({gi},R) = o. 


(7.8.34) 


In summary, we have defined a new function C, which equals the usual c 
when we are sitting at criticality, which generalizes the concept of a central 
charge for off-critical systems. Then the c theorem tells us that C must decrease 
monotonically along the path connecting two conformal field theories. Thus, 
via the c theorem, we have established an important link between different 
conformal field theories, that is, the renormalization group flows connecting 
them have decreasing central charge. 


References 


1. P. Goddard, A. Kent, and D. Olive, Comm. Math. Phys. 103, 105 (1986). 

2. B. L. Feigin and D. B. Fuchs, unpublished. 

3. VI. S. Dotsenko and V. A. Fateev, Nucl. Phys. B240 [FS12], 312 (1984). 

4. VI. S. Dotsenko and V. A. Fateev, Nucl. Phys. F251 [FS13], 691 (1985). 

5. VI. S. Dotsenko, Lectures on Conformal Field Theory , Advances in Studies in 
Pure Mathematics, vol. 16 (1988). 

6. M. A. Bershadsky, V. G. Knizhnik, and M. G. Teitelman, Phys. Lett. 151B, 31 
(1984). 

7. J. Bagger, D. Nemeschansky, and J. Zuber, Phys. Lett. 216B, 320 (1989). 

8. N. Ohta and H. Suzuki, Nucl. Phys. B332, 146 (1990). 

9. M. Kuwahara, N. Ohta, and H. Suzuki, Phys. Lett. 235B, 57 (1989). 

10. A. B. Zamolodchikov, JETP Lett. 43, 731 (1986); Soviet J. Nucl. Phys. 46, 1090 
(1987); Soviet J. Nucl. Phys. 44, 529 (1987). 

11. C. Vafa, Symposium on Fields , Strings, and Quantum Gravity , Beijing, 1989. 

12. C. Vafa and N. P. Warner, Phys. Lett. 218B, 51 (1989). 

13. C. Vafa, Mod. Phys. Lett. A4, 1169, 1615 (1989). 

14. W. Lerche, C. Vafa, N. P. Warner, Nucl. Phys. B324, 427 (1989). 

15. E. Martinec, “Criticality, Catastrophe, and Compactifications,” in Physics and 
Mathematics of Strings, World Scientific, Singapore (1990). 

16. V. I. Arnold, Singularity Theory , London Math. Lee. Notes Series, vol. 53, Cam¬ 
bridge University Press, London (1981); V. I. Arnold, S. M. Gusein-Zade, and 
A. N. Varchenko, Singularities of Differentiable Maps, vol. 1, Birkhauser, Basel 
(1985). 

17. E. Witten, Comm. Math. Phys. 121, 351 (1989). 

18. A. Schwimmer and N. Seiberg, Phys. Lett. 184B, 191 (1987). 

19. O. Alvarez, Nucl. Phys. B216, 125 (1983). 

20. B.R. Greene, C. Vafa, andN.P. Warner, Nucl. Phys. B324, 317 (1989). 

21. P. Ginsparg, Nucl. Phys. B295, 153 (1988). 

22. E. B. Kiritsis, Phys. Lett. 217B, 427 (1988). 



CHAPTER 8 


Knot Theory and 
Quantum Groups 


8.1 Chem-Simons Approach to 
Conformal Field Theory 

In the previous chapters, we analyzed the various schemes that have been 
proposed to catalog large numbers of conformal field theories, especially the 
rational ones with a finite number of primary fields. In this chapter, we will 
explore the most ambitious one, which is the use of Chem-Simons gauge theory 
[1] to classify conformal field theories. In the process, we will uncover a deep 
but unexpected relationship between conformal field theories and knot theory. 
Surprisingly, we will be able to use quantum field theory to generate new knot 
polynomials and analytic expressions for them. Knot theory, in turn, will be 
a tool by which we study conformal field theories and statistical mechanics, 
giving us a topological meaning to the Yang-Baxter relation. 

In contrast to the previous approaches to classifying conformal field theories, 
which were all completely defined in two dimensions, our starting point will 
be the Yang-Mills theory formulated as a pure Chem-Simons term in three 
dimensions. Our philosophy will be that the “miracles” that occur in conformal 
field theory are by-products of simpler structures that exist in three dimensions. 
Viewed from the perspective of “flatland” many puzzling relationships may 
have no obvious origin. However, once we “leave flatland,” as Atiyah and 
Witten have suggested, the origin of these relations can be seen transparently. 
The basic premise, therefore, is to see gauge symmetry and general covariance 
as the origin of the fortunate “accidents” appearing in conformal field theory. 

Our starting point will be the action [1]: 

L = -^J € ‘ jkTT [MdjA k - d k Aj) + \Ai[Aj, A k ]]. 


( 8 . 1 . 1 ) 




232 8. Knot Theory and Quantum Groups 


There are several unusual features to this action. First, the integrand is a total 
derivative, that is, it is a topological term that can be written as the integral of 
a derivative. With the usual boundary conditions at infinity, the Chem-Simons 
action is zero. However, if the fields do not vanish at infinity so rapidly, one 
can show that the action does not vanish. The action is invariant under gauge 
transformations that contain the identity, but it is not invariant under gauge 
transformations that have nonzero “winding numbers.” In fact, under such a 
gauge transformation, the action changes by an integer 

L -> L + constant m. (8.1.2) 

The second unusual feature of this action is that it is generally covariant 
without the presence of a metric tensor. For decades, physicists have viewed 
this symmetry as a result of integrating over all possible metric tensors in the 
functional integral. 

However, because the constant tensor € ljk transforms as a true density under 
coordinate transformations, we see that the Chem-Simons action is actually a 
generally covariant object, even if it lacks any metric tensor. There is no need 
to insert the determinant of the metric tensor V 11 # into the action to form 
scalar densities. This leads to the unusual conclusion that the observables of 
the theory must be independent of the parametrization, that is, they must be 
topological objects, with finite degrees of freedom. 

Topological field theories of this type are very strange when viewed from 
the point of view of ordinary quantum field theory. For example, even a point- 
particle field theory, the simplest possible quantum field theory, has an infinite 
number of degrees of freedom. These topological theories, because they only 
have a finite number of degrees of freedom, describe topological objects and 
are not physical in the strict sense of the word. In fact, we will see that the space 
of observables consists of pure numbers, for example, topological invariants 
associated with knots. 

The observables are gauge invariant and independent of the background 
metric (i.e., they are topological) and consist of Wilson loops 

W R (C) = Tr P exp i j Aidx \ (8.1.3) 

where we take the path-ordered P product of exponentials around a knot loop 
or knot C and trace over the R representation of the Lie group. The object that 
we wish to study is the functional average of these Wilson loops defined over 
knots Ci (which are closed) or a series of “links,” which intertwine several 
closed knots 

n Wr, ) = f DA exp(iL) Y\ W Rl (Ci). (8.1.4) 

;=i / J i =1 

This object must be a topological object, that is, by changing the background 
metric, the correlation function remains the same. The topological invariants 



8.1 Chem-Simons Approach to Conformal Field Theory 233 


of knot theory are called “knot invariants,” so we now have an analytic way in 
which to generate knot invariants via quantum field theory. (However, it should 
be emphasized that the string itself moves in 26-dimensional space and does 
not form knots. In fact, in four and higher dimensions, it can be shown that 
knots formed by the string can be untied or unraveled. Knots, consisting of one¬ 
dimensional lines, only exist in three dimensions. To create higher-dimensional 
knots, one has to twist planes and solids.) 

The next step is to quantize the theory. There are two ways in which to 
perform quantization: 

(1) One may impose the constraints first and then quantize the theory in the 
reduced system, as in Coulomb gauge quantization. This has the advantage 
of working directly with the reduced space, which will turn out to be finite 
dimensional. If we work in the Coulomb gauge, we define the fields along 
a slice in three-dimensional space. Along the two-dimensional slice, we 
can, in turn, rewrite the coordinates in terms of z and z coordinates, that 
is, in terms of holomorphic and antiholomorphic coordinates. 

(2) One may alternatively quantize the system first and then impose the con¬ 
straint (Gauss’s law) on the Fock space, as in Gupta-Bleuler quantization. 
In this way, we work with the full infinite-dimensional space spanned by 
A?, and only at the end, do we see the finite-dimensional space emerge. 

Let us analyze the first case. As in ordinary gauge theory, the constraint is the 
coefficient of the Lagrange multiplier A 0 in the action. In our case, however, the 
constraint is qualitatively different from that found in ordinary gauge theory 

€ ij F ij = 0. (8.1.5) 

Gauss’s law tells us that the space M spanned by the connections A? must 
be reduced to the space of flat (i.e., zero curvature) connections, modulo gauge 
transformations. 

Naively, having zero curvature means that the connection is a pure gauge 
field, and hence, the theory is empty. In our case, the meaning of this strange 
constraint is that an infinite-dimensional system, labeled by Af, has been 
reduced to a system with only a finite number of degrees of freedom. The 
constraint destroys an infinite number of degrees of freedom, but it leaves a 
finite-dimensional system intact, spanned by topological objects. 

The space of flat connections modulo gauge transformation has already been 
classified by mathematicians, and it is the space of conformal blocks [1-3]. 
Thus, conformal field theory enters at the level of the Hilbert space of the 
theory after quantization. 

Flat connections, in turn, are characterized by their holonomies (i.e., Wilson 
loops) around closed paths. These paths, unlike the situation in ordinary gauge 
theory, are topologically defined; if one distorts them continuously, then the 
holonomy remains the same. 

That the Wilson loops are topologically defined can be seen by pinching off a 
small circular deformation in a path ordered product. The change in holonomy 



234 8. Knot Theory and Quantum Groups 


around this small loop will be proportional to the curvature in that loop. But, 
since the curvature is zero, there is no change if we make a small deformation 
in the path. Thus, these paths are defined topologically, unlike the usual case 
in ordinary gauge theory. They depend only on the homology cycles on the 
Riemann surface, that is, the a and b cycles. The dimension of the conformal 
blocks on this genus g Riemann surface for a Lie group G is finite and is given 
by 

2(g — l)dim G; g > 1. (8.1.6) 

The situation differs slightly if we quantize in the presence of a knot or 
Wilson loop defined in this space. When one slices a knot by taking equal time 
slices, one see charges distributed along the surface, so the constraint equation 
becomes 

r* iJF z = E* 2(jc - Wr ( 8 - L7 ) 

5 = 1 

where the sources are located at points P s . Then, the physical space of the 
theory describes the conformal blocks for an r-point function for fields defined 
in various representations of the algebra. 

Let us analyze this quantization more carefully. Before gauge fixing, let us 
split the three-dimensional space into Y = E ® R, a Riemann surface E times 
the time direction R. Let us use the language of forms for convenience. Then, 
the exterior derivative d and the gauge field A split up as 

d = dt d/dt + d 9 A = A t + A. (8.1.8) 

With this splitting the action becomes 

S — —— f Tr( A—Adt\ + — f Tr (dA + A 2 ). (8.1.9) 

4tt J y \ dt ) 2 tv J Y 

Since the last term fixes the curvature to be zero, we can solve this to give 


A = -{dU)U~ x (8.1.10) 


which has zero curvature. Assuming E is bounded, we can plug this into the 
original action and find 


S = kSwzw '■ 


— / Tr (U~ 

4^r J dY 


D 




( 8 . 1 . 11 ) 


where (p is an angular variable defined around the perimeter of E. 

We see that the Chem-Simons action has become a version of the WZW 
action, which we know is conformally invariant. Here, we see how the full Kac- 
Moody algebra emerges from the Chem—Simons theory, as well as conformal 
field theory. Likewise, we can define the theory on cosets and retrieve all the 
results of the GKO coset construction. 



8.1 Chem-Simons Approach to Conformal Field Theory 235 


Another way of viewing these results is to quantize first and then apply the 
constraints on the wave function later. In this scheme, Af still has an infinite 
number of degrees of freedom, which become finite only when one applies the 
constraints. The quantization is carried out by postulating the commutation 
relations 

Ayr 

[A a z (z), Af(u,)] = T S ab 8 2 (z - w), (8.1.12) 

where we have converted the coordinates on E into holomorphic and 
antiholomorphic coordinates. 

We now postulate the existence of a wave functional ^(AJ, where the field 
A z has an infinite number of degrees of freedom. We wish to impose Gauss’s 
law constraint directly onto this wave functional, thereby reducing the number 
of degrees of freedom of the system. This wave functional transforms under a 
gauge transformation labeled by g as 

[C/(g)'I']04 z ) = (8.1.13) 

where S is the WZW action. If we apply the constraint onto A z ), it means that 
the wave function must be gauge invariant. Thus, the physical wave function 
is given by [2]: 

^ P hy(^ z ) = j Dge lkS ^ A ^ 0 (Af). (8.1.14) 

Our task is to show that 'I'phy depends only on a finite number of degrees of 
freedom because it is invariant under the gauge constraint, while the original 
depends on an infinite number of degrees of freedom. 

In contrast to ordinary gauge theory, in the Chem-Simons theory, we have 
the freedom of gauging A z to a constant a in the Cartan subalgebra, that is, 

A z = gag~ l - dg g~\ (8.1.15) 

so A 8 Z can be replaced by a constant a . Thus, the wave function’s infinite degrees 
of freedom have been reduced. However, the remaining ^ P h y still has a finite 
number of degrees of freedom. For example, let E equal the toms T 2 . Then ^ phy 
is a function of the modular parameter t and a. We can power expand the wave 
function in a complete set of states which have the correct boundary conditions 
for T\ which are the characters x*(r, a). This is equivalent to writing ^ phy as 
a trace over the toms and then inserting a complete set of operators that belong 
to a Verma module into the trace. We get 

'I'phy^, a) = AxXx(r,a). (8.1.16) 

Thus, the wave function, defined on a toms, can be expanded in the space of 
characters defined on the toms, which in turn can be written as a Weyl-Kac 
character formula. 



236 8. Knot Theory and Quantum Groups 


Likewise, the same reasoning can be applied when E equals a disk. Again, 
we see that a gauge transformation on can reduce the infinite degrees of free¬ 
dom contained within A Z9 leaving only a function depending on the parameters 
of the disk and the value of the constant a . We then expand the remaining wave 
function, which is now power expanded in terms of a complete set of functions 
defined on the disk, that is, conformal blocks. In this way, conformal blocks 
enter the theory using the wave functional method. 

In summary, we find a large set of correspondences between the Chem- 
Simons theory and conformal field theory that allow us to classify the latter 
via the gauge group G and the coupling constant of the Chem-Simons theory. 
Specifically, we found that: 

(1) the space of flat connections modulo gauge transformations is equivalent 
to the space of conformal blocks; 

(2) the space of states of the wave function defined on a torus is equivalent to 
the space of Weyl-Kac character functions; 

(3) when we choose the gauge group G with a subgroup H , we can construct 
the apparatus of the GKO coset method; and 

(4) a system with an infinite number of degrees of freedom is reduced, after 
quantization, to a space with a finite number of degrees of freedom, related 
to the invariant holonomies one can construct on E with genus g . 

A systematic study shows that all known rational conformal field theories 
can be constructed in this way. This is remarkable, because we have only 
assumed general covariance and gauge invariance, that is, we have derived the 
results of conformal field theory using much simpler structures. 


8.2 Elementary Knot Theory 

The next step is to calculate the correlation functions of the theory, which 
are matrix elements of Wilson loops. To understand how knot invariants arise 
from these correlation functions, it will be useful to review the developments 
in knot theory [4, 5]. Historically, knot theory has, over the decades, been a 
stagnant area of mathematics. The central problem of knot theory, the complete 
classification of all possible knots, has consistently eluded mathematicians. 
The problem is to construct certain expressions, called knot invariants , that 
can be placed in one-to-one correspondence with topologically distinct knots. 

The problem is an exceedingly difficult one because knots, even if they 
have only a small number of loops, become quickly snarled and entangled, 
making it difficult to decide which ones are really distinct and which ones can 
be deformed into each other without cutting the knot. 

Let us make a few simple definitions. To be more specific, we say a knot is 
a single strand or line that is closed, that is, if we cut the knot, it unravels into a 
single strand. Then, we define the unknot as a knot which can be topologically 
deformed into a circle, without cutting. Also, define a link as a series of knots 



8.2 Elementary Knot Theory 237 


that are intertwined and cannot be separated. If we cut N distinct strands that 
form a link, then it reduces to N independent single strands. 

The first major topological invariant in knot theory was found by Gauss; it 
is called the linking number and is defined as 

$(Ca,Q = |f dx* f dxU ijk { *~ y) l , (8.2.1) 

Jc a Jc b \x-y\* 

where dx l are defined along the knot. 

The Gauss linking number is an analytic expression defined on a link that 
is topologically invariant. By performing the integration around the strands, 
one can, in principle, determine the degree to which several closed loops are 
linked together. Notice that the linking number remains invariant even if one 
smoothly deforms the contour, as long as one does not move contours past 
each other or cut them. Thus, this expression must be a topological invariant. 

In the nineteenth century, Tait and Little began the arduous task of beginning 
a classification of topologically distinct knots. In 1970, Conway pushed the 
classification to 10 double points [6]. 

One fundamental advance was made in 1928 by Alexander [7], who showed 
that it was possible to associate a polynomial, called the Alexander polyno¬ 
mial!, with each knot. If two knots had different polynomials, then they were 
topologically distinct. This made it possible to take two complicated knots, 
calculate their Alexander polynomial, and rapidly decide whether they were 
topologically distinct or not. 

For example, the circle is a trivial knot and has an Alexander polynomial 
given by the number 1. The clover leaf knot, shown in Fig. 8.1(a), has the 
Alexander polynomial given by 

clover leaf: A = t 2 — t + 1. (8.2.2) 

Another example is the figure-eight knot, shown in Fig. 8.1(b), with an 
Alexander polynomial given by 

figure — eight: A = t 2 — 3t + 1. (8.2.3) 

A more complicated knot is called the Stevedore’s knot, shown in Fig. 8.1(c), 
with an Alexander polynomial given by 

Stevedore's knot: A = It 1 — 5t + 2. (8.2.4) 

The Alexander polynomials A, furthermore, can be shown to obey the fol¬ 
lowing identity. Let an L+, L_, and L 0 be three links that are completely 
identical, except that, at one juncture, they have the topology as shown in Fig. 
8.2. Then, the Alexander polynomial, for these three different links, satisfies 
the following relation: 

A i + — A l _ = (Vi - l/Vt)A Lo (8.2.5) 

which is called the “skein relation.” 



238 8. Knot Theory and Quantum Groups 




However, Alexander polynomials have an important defect; they are not 
powerful enough to distinguish between all topologically distinct knots. In fact, 
they often fail to distinguish between elementary knots. For example, both the 
granny knot and the square knot have the same Alexander polynomial, but they 
are topologically distinct. Also, the Alexander polynomial cannot distinguish 
between knots that are mirror reflections of each other. 

Another important development in knot theory was Artin’s theory of braids 
[8]. Because knots are such difficult objects to manipulate, Artin introduced 
a much simpler system by which to analyze knots. A braid begins as a series 
of parallel strands with equal length arranged in a definite sequence. We are 
allowed to cross one strand over another by a braiding operator. Let U l be the 
braiding operator that moves the ith line across the (i + l)th line. Then, Artin 

v ^ 

A /\ 

L + I— L o 

FIGURE 8.2. 




8.2 Elementary Knot Theory 


239 


1 2 3 



FIGURE 8.3. 


showed that these braiding operators form a group 


braid group : 


UiUj = UjUi, \i - j\ > 2, 


( 8 . 2 . 6 ) 


In Fig. 8.3, we have an example of a braid specified by the U operation. 

The advantage of using the braid group is that we can form knots and links 
out of these braids. If we identify the two sets of endpoints of the braid, then 
the lines close on themselves and we get a link. For example, n parallel lines 
forming a braid, when wrapped in this fashion, make n closed loops, or unknots. 
All knots and links can be generated by wrapping a braid. (However, the power 
of this technique is limited because two different braids, when wrapped, may 
yield the same knot or link.) 

After years of slow progress in knot theory, two recent breakthroughs in this 
area came quite suddenly, within the last few years. The first breakthrough, 
after an interval of almost 60 years, was the development of the Jones polyno¬ 
mial in 1985 [9, 10], which associated a new polynomial to every knot, more 
powerful than the earlier Alexander polynomial. With the Jones polynomial, 
the classification of all possible knots was suddenly within reach. The origin 
of the Jones polynomial, however, was quite obscure. Also, the Jones polyno¬ 
mial still was not powerful enough to generate a one-to-one relation between 
a polynomial and a knot. 

The second development, which we will soon discuss, came from an en¬ 
tirely unexpected area, two- and three-dimensional quantum field theories that 
could be written in terms of knots. In fact, it became possible to rederive the 
Jones polynomial from physics and to create infinitely many more polyno¬ 
mials. Eventually, with this new generation of knot invariants coming from 
physics, it may be possible to find a one-to-one description of knots in terms 
of polynomials, although this is still not certain. 



240 8. Knot Theory and Quantum Groups 

8.3 Jones Polynomial and the Braid Group 

Jones made the following observation [9]. In mathematics, there is the von 
Neumann algebra A n9 a finite-dimensional algebra whose elements e, obey 
the following relations: 

ej = e t , e* = e,, 

e.-e.-iie,- = [t /(1 + t) 2 ]et, (8.3.1) 

eiej=ejei, \i - j\ > 2. 

Jones noticed the similarity between the von Neumann algebra A n 
and Artin’s braid group B n . Specifically, one can make the following 
correspondence between the two algebras 

Ui VF[fe, - (1 - €i)]. (8.3.2) 

Although this correspondence between two seemingly unrelated algebras ap¬ 
pears to have no consequence, there is an important difference: on the von 
Neumann algebra (in contrast to the Artin algebra), it is possible to define the 
trace operation, which sends elements of A n into a complex number, such that 

Tr (ab) = Tr (ba), 

Tr(we n +\) = [t/(l + r) 2 ] Tr w if u;isinA„, (8.3.3) 

Tr {a a) >0 if a ^ 0, 

and Tr(l) = 1. Because an invariant trace is defined on the von Neumann 
algebra and a correspondence can be made between the generators of the von 
Neumann algebra and the generators of the Artin algebra, we can now define 
the invariant trace operation on the Artin algebra 

V L {t) = [-(* + 1)/V7] n_1 Tr[ r ,(*>)], (8.3.4) 

where r, is the operation that sends generators of one algebra into the other. 

This invariant polynomial V L (t) is called the Jones polynomial. Like the 
Alexander polynomial, it also satisfies a skein relationship 

d/t)V L _ - tV L+ = (Vf - 3=) V Lo . (8.3.5) 

(Via the skein relation, one can recursively generate the Jones polynomials. 
One starts with the unknot, which has the knot polynomial 1, and then suc¬ 
cessively applies the skein relation to get the polynomial of increasingly more 
complicated knots.) 

The advantage of the Jones polynomial is that it reveals elegant relations 
between knots and other types of algebras, as well as being much more powerful 
than the Alexander polynomial. Topologically distinct knots, which have the 
same Alexander polynomial, can be shown to have different Jones polynomials. 



8.3 Jones Polynomial and the Braid Group 241 



(However, the Jones polynomial, as we mentioned, fails to establish a one-to- 
one relationship between a polynomial and a knot, that is, two topologically 
distinct knots may have the same Jones polynomial.) After the initial discovery 
of the Jones polynomial, several other, more powerful, polynomials were then 
discovered. 

To analyze these higher polynomials, let us make a few definitions. If two 
links L\ and L 2 are topologically equivalent, we say that they are ambient 
isotopic. It has been shown that two links are ambient isotopic if and only if 
there is a finite sequence of moves (called Reidemeister moves) that can deform 
L\ into L 2 . These moves, which come in three types, are shown in Fig. 8.4. 

However, there is also a second concept, called regular isotopy. Two links 
are regular isotopic if one can be deformed into the other by using only Type 
II and type III moves. 

Regular isotopic links have a simple interpretation. Let us temporarily re¬ 
place each strand in a knot by a long, flat ribbon, which makes a “framed knot.” 
An ordinary knot and a framed knot are topologically distinct because twisting 
a ribbon within a framed knot produces a new configuration, while twisting a 
strand of a knot does nothing. 

Notice that Type I Reidemeister moves are not allowed for two-dimensional 
or framed knots. If we execute the Reidemeister Type I move for ribbons, we 
find that they are equivalent to twisted ribbons when they are straightened out. 



242 8. Knot Theory and Quantum Groups 


Let us now introduce the Hoste, Ocneanu, Millet, Freyd, Lickorish, Yetter 
HOMFLY polynomial P L (/, z) [11], which is more powerful than the Jones 
polynomial, which is a finite Laurent polynomial in two variables. If two links 
are ambient isotopic, then they have the same HOMFLY polynomial. These 
HOMFLY polynomials are defined via 

tP L+ -r l P L _=zP Lo , (8.3.6) 

where L± and Lo were defined in Fig. 8.2. 

The HOMFLY polynomials are defined recursively. If we set the unknotted 
knot as Po = 1, then, by successively knotting various parts of the line via L± 
and L 0 , we can gradually build up knots and links of arbitrary complexity. 

Both the Alexander polynomial and the Jones polynomial can be represented 
as special cases of the HOMFLY polynomial. From Eqs. (8.2.5) and (8.3.5), 
we see 

P L (t — z — Vt — 1 /yft) = Alexander polynomial, 

(8.3.7) 

P L (t , z = —V* + 1 /\f t ) = Jones polynomial. 

The deficiency of the HOMFLY polynomial, however, is that it is not powerful 
enough to distinguish between all topologically distinct links, that is, like the 
Jones polynomial, it is possible to construct two that are ambient isotopically 
distinct but have the same HOMFLY polynomial. 

Notice that the Alexander, Jones, and HOMFLY polynomials share the prop¬ 
erty of categorizing links that are ambient isotopically the same. However, it 
is possible to introduce another polynomial, the Kauffman polynomial [12], 
that is used to analyze links which are only regular isotopically the same (i.e., 
for framed links or ribbons). 

The Kauffman polynomial R L is defined by 

Rl + ~ Rl _ = zR Lo (8.3.8) 

with the additional condition 

R U =aR Lo , R L _=<x- 1 Rl b , (8-3.9) 

where the L± and L 0 are given in Fig. 8.5. Because the HOMFLY and Kauffman 
polynomials are related to ambient and regular isotopy, they cannot, of course, 
be directly related to each other via an ambient isotopic equation. However, 
it is possible to write the following relationship, which is defined only up to 
regular isotopy 

P L (t =ct,z) = a-^Rdcx, Z ), (8.3.10) 

where w(L), called the wraith, is given by 

u>(L) = ^e(p) (8.3.11) 

p 

and <?(L ± ) = ±1. The important point is that the wraith is only a regular iso¬ 
topic invariant, and hence, there is no contradiction in the above equation. The 



8.4 Quantum Field Theory and Knot Invariants 


243 


O 


O 


Lo 

FIGURE 8.5. 


distinction between the ambient isotopically defined HOMFLY polynomial 
and the regular isotopically defined Kauffman polynomial will soon become 
important when we use quantum field theory to generate these knot invariants. 


8.4 Quantum Field Theory and Knot Invariants 


The remarkable feature about the Chem-Simons field theory is that the cor¬ 
relation functions must be topological (since the metric never appears in the 
action), and hence, it reproduces the known invariants of topology and gen¬ 
erates entirely new classes of invariants as well. For example, take the matrix 
element between two gauge fields 


toWAjw) = (8A1) 

Notice that the two-point function in a Chem-Simons theory is not the usual 
propagator, with a well-defined light cone and propagation of states. Also, 
notice that the Chem-Simons theory is linear in derivatives. 

Next, let us power expand the Wilson loop as follows [13, 14]: 


W(C) 


— Tr ^1 + i £ dx M A p — £ dx p J dy v A v (y)An(x) 

- i dx* J dy v J dz p A p (z)A v (y)A fl (x) 

+ j) dx 11 j dy v J dz p j dw° A a (w)A p {z)A v {y)A tl (x) 


+ ■ 


(8.4.2) 



244 8. Knot Theory and Quantum Groups 

The advantage of this power expansion is that we can now take the matrix 
element of the Wilson loop and calculate the topological invariants of knot 
theory. However, there are some ambiguities that have to be clarified. 

Expanding in powers of 1/&, we find that the first term in the correlation 
function is 


(W(C)) 0 = dim R, (8.4.3) 

where R a are the generators of the group. To the next order in 1 /k, we have 

(W(C)) X = -Tr (R b R a )£dx» J"dy v {A b v (y)Al(x)) 

= -^ dim Rc 2 (R)<t>(C ), (8.4.4) 

k 

where c 2 (R) 1 = R a R a and 

<8AS) 

Notice that the above expression is actually ambiguous when x y . Oddly 
enough, the answer is dependent on how we regularize the correlation function. 
There are many ways to regularize this function when x and y are coincident, 
and the correlation function changes with each one. (This is actually reason¬ 
able, because the regularization dependent terms are phases, which drop out 
when calculating the squares of amplitudes.) 

The most common procedure is to introduce a framing for each contour, that 
is, expand each line into a flat, two-dimensional ribbon. We can frame a knot 
by introducing a vector n 11 that is orthogonal to C, that is, we can replace the 
contour C via a framing contour C / defined as 

x»(t) -* y»(t) + en^t) (8.4.6) 


for small €. Thus, we have now reproduced a result from classical mathematics, 
the Gauss winding number for a knot from quantum field theory. 

We can do better than just rederive known topological invariants; in fact, 
we can derive analytical expressions for knot invariants that are only known 
abstractly to mathematicians. 

For example, taking the next order in 1 /k, we find an analytical expression 
for the second coefficient of the Alexander polynomial [13, 14]: 

(W(C) 2 ) = (2n/k) 2 dim R[ - \cl(R)4> 2 {C) + c v c 2 p(C)}, (8.4.7) 
where p(C) is p\ + p 2 and 

MC) = 32jr 3 £ dx^ dy ^ dz P € ^ ? 


I aXr (x, y, 



, (u> — x) a {w — y) k (w — z) x 

d w -----—, 

\w — x| 3 |w — y| 3 |u> — zr 


(8.4.8) 



8.4 Quantum Field Theory and Knot Invariants 245 


and 




dxxf 


X Zova^pfAp 


(w - y)“(z - xY 
|u> — y| 3 |z - x\ 3 ' 


(8.4.9) 


In this way, we can obviously continue and obtain analytic expressions for 
knots of arbitrary complexity. However, we now would like to make contact 
with the Jones polynomial and the Kauffman polynomial. 

The essential point is that the matrix elements of Wilson lines are ill defined 
when two points coincide, and hence we must introduce framed links. Thus, the 
knot invariants that arise from quantum field theory must be regular isotopically 
defined links. 

Now, let us derive the knot invariants for arbitrary links via quantum field 
theory. We will find it convenient to introduce yet another invariant, called 
Sl(u, ft, z), which is a function of three variables and is defined for regularly 
isotopic links. 

This new knot invariant satisfies [13, 14]: 


Sl + —vSlo’ 

Sl.=<*~'S L o , (8.4.10) 

PS L+ -r l S L _=zS Lo . 

We can relate this new knot invariant to the HOMFLY invariant via 


P L (t = otfi, z) = a w{l) S l ( a, p, z). (8.4.11) 

If we define So = 1 for the unknot, then we can, via these generalized skein 
relations, gradually build up links of arbitrary complexity. 

The whole point of introducing yet another knot invariant is that correlation 
functions of the Wilson loop generate Si , which in turn can be related to 
HOMFLY, Kauffman, and Jones polynomials. To prove this, we will show 
that ( W(C )) satisfies these skein relations, which will be sufficient to show 
that the correlation functions equal Sl for a link of arbitrary complexity. We 
will demonstrate that the correlation functions of Wilson loops satisfy these 
generalized skein relations by explicit calculation. 

Let us analyze the difference between L± and L 0 from the quantum field 
point of view. We see that the only difference between these three configura¬ 
tions is the insertion of a loop, which can be taken to be infinitesimally small. 
Our task, therefore, is to see how a quantum correlation function changes when 
we insert a small loop into a Wilson line. 

We begin by defining a Wilson line from x\ to X 2 and then inserting an 
infinitesimally small loop at point x, which lies between the endpoints. Since 
the line integral around a small loop generates the curvature tensor, the Wilson 



246 8. Knot Theory and Quantum Groups 


line operator U(x \, x 2 ) becomes 

U(x u x 2 ) -+ U(x u x)i^F“ v R a U(x,x 2 ), (8.4.12) 

where 

^ =dx*dx v (8.4.13) 

is the area tensor characterizing the small loop and R a is a generator of the 
algebra. 

Fortunately, the insertion of F^ v inside a correlation function is equivalent 
to taking the functional derivative of e lS with respect to the field. We can then 
integrate by parts to obtain the following: 

(CO, ft -) = z" / da ; (=£) jgj ft ft - 

= Z- 1 j DAle iS €flvp -^ (0,0 2 • • -XM.14) 

where we have integrated by parts. The next step is to take the derivative of 
the various which pulls down an R a , so the insertion of F* v generates the 
insertion of 

- i ( y) S ^ x ~ yKvpV* v dy p x(--. U(x u x) £ R a R a U(y, x 2 ) ■ ■ j 

(8.4.15) 

Thus, we see that the insertion of a small loop at point x yields the insertion 
of the curvature tensor, which in turn generates a factor proportional to 

S\x-y)€ tlvp ^ v dy p (8.4.16) 

which is a volume element oriented along the tangent to the Wilson line. We 
can normalize v to be 0 or ±1, depending on the orientation of Thus, the 
change in the correlation function by inserting a small loop along the Wilson 
loop is 

8(W(L)j = T(i47T/k)c 2 (R)(W(L)}. (8.4.17) 

Now, compare this to the first of the two generalized skein relations. We see 
that we can now set 

(w(U) = a {w(U)), 

(W(L)) = a- I {W(U)), 

where we now have an explicit quantum theoretic expression for a: 

a = l-i^c 2 (R) + o(J^j. (8.4.19) 

Now, to prove the last of the skein relations, we repeat the same steps as before, 
except for the configurations L± and L 0 . 



8.4 Quantum Field Theory and Knot Invariants 247 


In order to reduce the skein relations to the insertion of small loops, we must 
analyze the crossing of two Wilson lines. Notice that the crossing of L+ can 
be deformed into the crossing of L_ if we insert a small loop at the precise 
point of the crossing. For example, if the Wilson line U(x \,x 2 ) passes over 
the line U ( x 3 , x 4 ) at point x, then the insertion of a loop into U (xi, x 2 ) at the 
crossing point x changes the topology: now, U(x 1, jc 2 ) passes under U (x 3 , x 4 ). 
Concretely, we see that we can pass from L_ to L+ by the insertion of this 
loop 

{W(L + )) = ( W(L _)) + ( • • • U(x x , x)i E #il ’ F“ v R a U(x, x 2 ) • • • U(x 3 , jc 4 ) • • •). 

(8.4.20) 

Repeating the same steps as before, we find that the insertion of the curvature 
tensor is equivalent to taking the derivative with respect to the field, which in 
turn pulls down an R“. However, it is important to realize that the R“ factors 
no longer occur at the same point along the Wilson loop 

{W(L + )) = (W(L_)) 

- ~ 1 ~ E (‘ ‘' U ( Xl ’ X ) R “ U ( X > *2) • • • U(x 3 , x)R a U(x, * 4 ) • • • }■ 

(8.4.21) 

Because the R a matrices are defined at different points along the path, we 
must use the Fierz identity on the sum 

E R U R i> = ~ &ij8 u . (8.4.22) 


Notice that this Fierz identity, inserted at the crossing point x, has the effect 
of changing the SU(N ) topology of the graph, so that the Wilson loop defined 
at L+ spins off terms related to L_ and L 0 as follows: 

(W(L + )) = (l + (W(L_)) = -i^-(W(L 0 )). (8.4.23) 

Putting everything together, we find that we can satisfy the last of the 
generalized skein relations [1, 13, 14]: 

P{W(L + )) - r'{W(L_)) = z{W(L 0 )), (8.4.24) 

where 


iln ( 1 
P - l ~kN +0 {^ 



(8.4.25) 


All three of the generalized skein relations [ Eq. (8.4.10)] are now satisfied, so 
that we have a precise relationship (at least to order l/k 2 ) between correlation 
functions of Wilson loops and knot polynomials. 



248 8. Knot Theory and Quantum Groups 


In summary, we have been able to obtain new knot invariants, unite the 
old ones, and generate analytic expressions for them by using Chem-Simons 
theory. Because this theory is generally covariant without the presence of 
a metric, its states must be topological and gauge invariants, that is, knot 
polynomials. 


8.5 Knots and Conformal Field Theory 


It is also possible to use knot theory to analyze conformal field theories directly, 
without having to use a three-dimensional theory. This reveals yet another 
application of knot theory to physics and gives us a powerful tool by which 
to analyze the properties of conformal field theories, such as their correlation 
functions. 

The braid group of Artin emerges quite naturally if one views the monodromy 
properties of N-point functions in conformal field theory, that is, taking points 
H of an N-point function and then moving them around loops or interchanging 
them. If, for example, we interchange two points n and Zj in an N-point 
function, this is mathematically equivalent to braiding the ith string with the j th 
string. Thus, the N-point functions of conformal field theory, via monodromy 
relations, give us a representation of the braid group. 

For example, consider the Knizhnik-Zamolodchikov relations derived 
earlier for the Kac-Moody algebra in Eq. (3.4.8): 


K 


o ti • tj 

- Vp = > -— vp. 

r) 7 _- 7, — 7. 


dZi 


j# 


(8.5.1) 


Let us say that there are Z solutions to this equation. If we interchange the Zth 
and (Z + l)th point, then we find a linear combination of these same solutions 
[15], that is, 


i 

*i(z 1 , . . . , Z/+1, Z/, • • • » Zn) — ^ ^ Bj 'FjCzi, ..., Z/ , Zi+i? • • • > Zn)- (8.5.2) 

7=1 

Because the Bj operation simply interchanges the location of two points, it 
can be shown that the matrices Bj form representations of the braid group. 

Let us now be concrete about our discussion of the properties of correlation 
functions, by writing everything in terms of conformal blocks. We recall from 
Chapter 3 that there are two operations that we can perform on these conformal 
blocks, called B (corresponding to braiding or interchanging two points) and 
F (corresponding to pinching the graph, that is, making an ^-channel graph 
into a r-channel graph). 

On the space of conformal blocks, we wrote a representation of the B -twist 
operation on a four-point function, representing the interaction of conformal 



8.5 Knots and Conformal Field Theory 249 



fields labeled by i, j,k, l: 

{ \ <&*,&)<I>i(z,). (8.5.3) 

It is easy to see that the effect of the B operation is to interchange z\ and zi, 
that is, it is a braiding operator. 

Now we wish to relate the Yang-Baxter relationship and knot theory directly 
to conformal field theory. First, we note the similarity between Fig. 8.4 and 
Fig. 6.3, which represents the Yang-Baxter relationship. In fact, we see that 
they are the same. This means that the Yang-Baxter relationship is a specific 
realization of the braid group. 

Next, we notice that the Yang-Baxter relationship can be rewritten as a series 
of braiding operations on conformal blocks, that is, conformal field theory gives 
us a representation of the Yang-Baxter relationship in terms of the braiding 
matrix of conformal field theory. It is now a simple matter to represent the 
Yang-Baxter relationship on the space of braiding matrices by labeling all of 
the legs and singling out each braiding operation. At the end of the calculation, 
we will set the two equations equal to each other. 

By carefully following the strands making up the Yang-Baxter relationship 
in Fig. 8.6 and isolating the braiding matrices, we easily find [16]: 





h h 
j\ h 



(f) Hj 

h U 
h h 


\ h P £] (e) *“ 
<‘>Ma i\ 


h 7*4 

h h _ 


(0 


(0* 


PJ9 


J 2 J 3 
76 7*5 


(e). (8-5.4) 


In summary, we see here the tight relationship between knot theory, the Yang- 
Baxter relationship, and the braiding matrices of rational conformal field 
theory. We have used the equivalent topology of all three theories to write 
an explicit representation of the Yang-Baxter relation in terms of the braiding 
matrices of rational conformal field theory. Knot theory has thus proven to 
be a powerful tool in which to analyze conformal field theory. It should be no 
surprise, therefore, that knot theory should also be a useful tool in which to ana¬ 
lyze statistical mechanical systems. Specifically, we will show that knot theory 



250 8. Knot Theory and Quantum Groups 


gives us a new way in which to view the Yang-Baxter relations, which we saw 
was the basis for integrability of two-dimensional statistical mechanical sys¬ 
tems. To see how knot theory gives us a way in which to reanalyze statistical 
mechanical systems, recall that in Chapter 6, we found that there were two large 
classes of models, the vertex models and the IRF models, and that the essence 
of commuting transfer matrices was the Yang-Baxter relationship. Now, let us 
probe the topological structure of the Yang-Baxter relationship. Because the 
Yang-Baxter relationship is a tangle of indices, it helps to introduce operators 
that are defined in the space of lattice indices, such that the tangle of indices 
disappears. Let us now introduce a Yang-Baxter operator X;(w), which we 
will show is identical to the braid operator. We will define this operator as a 
generalization of the Boltzmann matrix shown above, operating on the space 
of the lattice. Thus, it will consist of the Boltzmann matrix multiplied by the 
unit matrix defined on the lattice. For the vertex model, written in terms of the 
S matrix, this operator is explicitly 

Xi(u)= (8.5.5) 

k,l,m,p 

where 7 (,) is the identity matrix at the ith position, and ( 8 nk ) ab = 8 na S kb . For the 
IRF representation, the Yang-Baxter operator is also a series of delta functions 
multiplied by the Boltzmann weight 

/ — I n 

[Xirntt = FI hMk , / '+>’ p*’ 1 *- 1;“) FI 8( p^hi < 8 - 5 - 6 ) 

7=0 ;=*+! 

(In other words, X, consists of a product of delta functions except for the zth 
entry, which consists of the w matrix.) 

Then, the Yang-Baxter operator satisfies the relations 

Xt(it)Xj{v) = Xjivmu), \i -j | > 2, 

Xi(u)X i+l (u + v)Xi(v) = X i+l (v)Xi(u + v)X i+l (u). 1 ‘ ’ 

For u = u + v = u, we find precisely the braid group relations of Artin. We 
will, in fact, take the limit u, v oo and set 

Ui = lim Xi(u)/p(u), i = 1, 2,..., n, 

u—>oo 

U~ l = lim Xi(—u)/p(— «), i = 1, 2,..., n, 

u —>• oo 

so, we arrive at 

U i U j = U i Uj, \i-j\ > 2, 
u,u l+] u, = U l+] U,U i+l . 

Notice that these relations form Artin’s braid group found earlier in Eq. (8.2.6). 

Thus, the Yang-Baxter relationship has now been reduced to the relations 
of braid group theory, which have proven to be one of the essential ingredients 
in the study of knots. 


(8.5.8) 


(8.5.9) 



8.6 New Knot Invariants from Physics 251 


8.6 New Knot Invariants from Physics 


The Jones polynomial, after its discovery, quickly led to other proposals for 
new knot polynomials. There exists, however, a powerful way of generating 
new knot polynomials that comes from statistical mechanics. In fact, an infinite 
number of new knot polynomials can be generated in this fashion [17, 18]. 

The key to this program is the use of Markov moves [19]. If two closed 
braids represent the same link, then it is possible to deform one link into the 
other link by a succession of these Markov moves. Let A and B be elements in 
Artin’s braid group B n . Then equivalent braids expressing the same link can be 
mutually transformed by successive applications of two types of operations, 
called Type I and Type II moves 


Type I : AB -> BA, 

Type II: A -* AU n , A ^ U~\ 


( 8 . 6 . 1 ) 


for A, B and products with U n defined to exist within the braid group. 

Given Markov moves on equivalent braids, we now have sufficient infor¬ 
mation to construct the desired link polynomial. For a link polynomial to 
be a topological invariant, it must obey the relationships created by Markov 
moves. Thus, any link polynomial, to be a topological invariant, must satisfy 
the following properties: 


a(AB) = ot(BA), 
a(AU n ) = a(AU; 1 ) = a(A). 


( 8 . 6 . 2 ) 


The important point is to notice that any nontrivial function a , which obeys 
these rules, via Markov moves, is guaranteed to be a topological invariant. 
If two links have different values of a , then they must, by construction, be 
topologically invariant. (However, as before, the converse may not be true, 
that is, this is still not powerful enough to establish that the correspondence 
between classes of topologically equivalent knots and topological invariants is 
one-to-one.) 

The next step is to notice that Artin’s braid relation has precisely the same 
topology as the Yang-Baxter relationship. Thus, the transfer matrices of statis¬ 
tical mechanics, because they have the same topological structure as the braid 
group, can be used to define new knot polynomials. The key step is to define, 
as with the Jones polynomial, a trace operation on the transfer matrices that 
obey the braid relations. Then, by construction, this trace must be the basis of 
a new knot polynomial. 

Let us define, for the moment, a linear functional (j> that satisfies 


4>(AB) = <t>(BA), 
<t>{AU n ) = r0(A), 
<P(AU- 1 ) = i<t>(A), 


(8.6.3) 



252 8. Knot Theory and Quantum Groups 


where the constants r and ? are given by 

X =4>(Ui), r=<p(ur l ) for all/. (8.6.4) 

Then, the desired value of the knot polynomial is given by [17]: 

ce(A ) = (rf)- ( "- 1)/2 (r/f ) e(A)/2 <p(A), (8.6.5) 

where e(A) is equal to the sum of the exponents appearing in the braid 
representation of A. 

Our goal, therefore, is to find a solution for the trace operation in terms of 
two-dimensional quantum systems. The key to this will be the Yang-Baxter 
relationship. 

We now have an explicit representation of braid operators in terms of Yang- 
Baxter operators for the Af-state vertex model. This, in turn, allows us to 
introduce the 0 trace operator, which will allow us to create a polynomial 
that is invariant under Markov moves. 

The last step is filled by noticing that 0 can be represented by the ordinary 
trace over the transfer matrix 

0(A) = Tr(tfA), A e £„, (8.6.6) 

for 

H = h m ® h (2) ® • • • ® h {n \ (8.6.7) 

where 



( 8 . 6 . 8 ) 


(8.6.9) 


Thus, the final value of the invariant knot polynomial, in terms of transfer 
matrices, is given by [17]: 


a(A) = [r~ (W-!)/2 (l +? + ••• + t N ~ l )Y~ l [?( ;v-1) / 2 ] e( " 4) Tr(//A). (8.6.10) 

Our strategy is now simple. The trace operation in Eq. (8.6.10) is guaranteed 
to generate knot invariants, by construction. Our task is to simply look up 



8.6 New Knot Invariants from Physics 253 


the various S matrices that have been computed in the past for various two- 
dimensional statistical models and insert them into Eq. (8.5.5), giving us the 
generators X, («) of the braid group constructed out of the S matrix. Then, we 
select a particular knot, reexpress it in terms of braiding operators, and insert 
them into the trace operation in Eq. (8.6.10). In this way, we are using the S 
matrices found in statistical mechanics as a way in which to generate braiding 
operators with a trace operation defined on them. 

For example, for the N = 2 model (corresponding to the six-vertex model), 
we find the following S matrix 

*.1/21/2, ^ _ sinh(A. — u) 

W(M) " sinh(A) * (8.6.11) 

c-l/2-1/2 _ , 

1 / 21/2 — L - 

Inserting this 5 matrix into Eq. (8.5.5) and then inserting the resulting braid¬ 
ing operators into Eq. (8.6.10), we find that we can rederive the old Jones 
polynomial. 

The great advantage of using this statistical mechanical construction, how¬ 
ever, is that there are an infinite number of such models, labeled by N, that 
allow us to generate new knot polynomials beyond the Jones polynomial. For 
example, for N = 3 (corresponding to the 19-vertex model), we find 


S] ,‘(w) = 
5qo (“) = 


^oo( M ) — 


sinh(A, — u) sinh(2^ — u) 

sinh X sinh(2A.) 
sinh(w) sinh(A. — u) 

sinh X sinh 2X 

sinh X sinh 2X — sinh u sinh(X — u) 
sinh X sinh 2X 


si, }(u) = 1, 


So?(«) 


sinh(A — u) 
sinh A. 


( 8 . 6 . 12 ) 


The power of this infinite set of knot polynomials is that they can distinguish 
between knots where the Jones polynomial fails. For example, Birman has 
shown that the two knots given in Fig. 8.7 by Artin’s braid relations have the 
same Jones polynomial 


A = (i/ lW ,)V^. (86n) 

B = U^Ul 1 . 

The Jones polynomial for both these knots is given by 

V A = V B = r 3 (t 18 - t 17 + 2/ 16 - 3t 15 + 4/ 14 - 5t 13 + 6t 12 

- 6t n + 6f 10 - 6r 9 + 6r 8 - 5 1 1 + 6 1 6 - 4r 5 (8.6.14) 

+ 4f 4 - 3/ 3 + 2 t 2 -t + 1). 


However, for N 


3, these two knots have different knot invariants. 



254 8. Knot Theory and Quantum Groups 




FIGURE 8.7. 


Specifically [17]: 

a(A) = r 10 ( 1,-1, 0,2,-2,-1,4, -2,-2, 4, -3, 0,5,-7, 1,11,-11, 
-4, 16, -8, -9,14, -4, -8, 11, -3, -7, 12, -5, -7,13, -7, -6, 
14, -7, -9, 16, -3, -10, 10, 1, -5,4, -1, -1, 4, -4, -1, 5, -2, 
-2, 3,0,-1,1), 

«(£) = r 12 ( 1, - 1 , 0 , 2 , - 2 , -1,4, -3, -2, 6, -4, -2, 8, -4, -4, 10, 

-6, -5, 12, -7, -5, 12, -7, -5, 13, -7, -5, 12, -7, -5, 12, 

-6, -6, 12, -5, -6, 12, -4, -6, 10, -3, -6, 

8, -2, -5, 6, -1, -3, 4,0, -2, 2, 0, -1, 1), 

(8.6.15) 

where we only present the coefficients of each power oft, beginning with t 54 
for knots A and B. In summary, we have succeeded in using the 5 matrix found 
in vertex models in statistical mechanics to generate braiding operators and a 



8.7 Knots and Quantum Groups 255 


trace operation, out of which an invariant knot polynomial can be constructed 
that gives us an infinite family of invariants beyond the Jones polynomial. 


8.7 Knots and Quantum Groups 

So far, our discussion of the Yang-Baxter relation has been rather ad hoc and 
not very systematic. In studying the symmetry properties of the Yang-Baxter 
equation and knots, one feels that there must be a deeper, underlying group 
theoretical origin to many of the relation’s miraculous properties. In fact, there 
is indeed a rich mathematical structure, called quantum groups [20-24], that 
accounts for many of the properties of the Yang-Baxter relation and gives us a 
systematic way in which to analyze them. It also give us explicit representations 
of the polynomial 6 j equations studied in Chapter 3. 

The name “quantum group” comes from the very specific way in which 
the Yang-Baxter relationship is realized. We recall that the Yang-Baxter 
relationship can be written as 

R l2 (u)Ri 3 (u - v)R 23 (v) = R 23 (v)R u (u - v)R l2 (u\ (8.7.1) 

where R , of course, is a matrix, and we have suppressed indices. Be¬ 
cause the Yang-Baxter relation gives solutions to complex two-dimensional 
quantum mechanical systems, it will be a function of Planck’s constant. There¬ 
fore, sometimes the previous relationship is called the quantum Yang-Baxter 
relation. 

In the limit of small/£, however, one should be able to obtain the classical 
limit of the relation. Let us, therefore, power expand the R matrix in terms of 
Planck’s constant 

Rij(u) — l - ihrij(u). (8.7.2) 

Then, as a function of the r matrix, the Yang-Baxter relationship reduces to 

[r l2 (u), r\ 3 (u - v)] + [n 2 (w), r 23 (u)] + [n 3 (w - v ), r 23 (v)] = 0. (8.7.3) 

This latter relationship is called the classical Yang-Baxter relation. In the 
limit that R goes to zero, the quantum and the classical Yang-Baxter relations 
become equivalent. 

The important thing to notice is that the classical Yang-Baxter relationship 
is expressed in terms of ordinary commutators. In fact, we see that the classical 
Yang-Baxter relation can be expressed as the Jacobi identity associated with 
some Lie algebra. Thus, a classification of the classical Yang-Baxter equations 
in terms of standard Lie algebras is possible. 

The full quantum Yang-Baxter relation [Eq. (8.7.11)], however, cannot be 
written in terms of commutators and is therefore considerably more compli¬ 
cated. Lie algebras cannot express the full quantum Yang-Baxter equation 
written in terms of R. However, because the limit h —> 0 exists, one suspects 
that a generalization, or deformation, of a Lie algebra-type structure must exist 



256 8. Knot Theory and Quantum Groups 


for the full quantum Yang-Baxter relation, which reduces to the usual one in 
the limit as/z -> 0. 

It turns out that this conjecture is correct, and the generalization of the Lie 
group is the quantum group. The quantum group is to the quantum Yang-Baxter 
relation as the classical Lie group is to the classical Yang-Baxter equation 

Lie group —^ classical Yang-Baxter, 
quantum group -> quantum Yang-Baxter. (8.7.4) 

Notice that the quantum group is necessarily a function of ft, and hence, pos¬ 
sesses a smooth limit in which it reduces back to the usual classical Lie group. 
We will find it convenient to introduce a parameter q , which is proportional to 
ft, so that 


lim quantum group = Lie group. (8.7.5) 

q^l 

A quantum group must share many similarities with an ordinary classical 
Lie group, but it must also differ from the Lie group in subtle but important 
ways. There are at least two intuitive ways in which to see these differences. 

First, we notice that the conformal field theory formed out of a Kac-Moody 
algebra is based entirely on its primary fields and its fusion rules. The primary 
fields, in turn, are labeled by the irreducible representations of the ordinary 
classical Lie group. Thus, much of the information and detail contained within 
the Kac-Moody algebra is actually washed out in the process of constructing 
the conformal field theory. We only see remnants of the full Kac-Moody alge¬ 
bra in the fusion rules. Thus, at the conformal field theory level, the resulting 
group structure resembles that of an ordinary Lie group in many ways. How¬ 
ever, the resemblance cannot be exact because the original Kac-Moody algebra 
depended on central charges, while ordinary Lie algebras are not compatible 
with central charges, that is, the Jacobi identity of a classical Lie algebra cannot 
accommodate the c-number term. Because the conformal field theory associ¬ 
ated with a Kac-Moody algebra depends on k and c in important ways, the 
reduced system cannot, therefore, define a classical Lie group. Although most 
of the structure of a Kac-Moody algebra is lost in the transition to the fusion 
rules, the central charges still make their presence felt in many important ways 
in the conformal field theory. 

Second, another way to see the subtle difference between classical and 
quantum groups is to notice that their braiding operations differ in a small 
but important way. For a classical Lie group, the braiding operation simply 
interchanges the representations forming the tensor product of two representa¬ 
tions. The braiding operation thus picks up factors of+1 or —1, depending on 
whether we have symmetric or antisymmetric combinations of the two repre¬ 
sentations. For example, tensor products can be constructed on the basis of the 
Young tableau, where composite representations are based on symmetrization 
or antisymmetrization. However, for a quantum group, the braiding operation 
picks up crucial phase factors. In Eq. (3.5.13), we recall, the braiding oper- 



8.7 Knots and Quantum Groups 257 


ator Q applied to the tensor product of two primary fields picked up phase 
factors, which were functions of the conformal weights of the various repre¬ 
sentations. Because of the presence of these crucial phase factors, we know 
that the group structure underlying the conformal field theory of a Kac-Moody 
algebra cannot be an ordinary classical Lie group. 

Now that we have established the differences and similarities between clas¬ 
sical and quantum groups, let us be specific. Let us take the deformation of 
the algebra of SL(2) q and show how it is related to the quantum Yang-Baxter 
relation and how it may generate solutions to the polynomial equations. The 
commutators of SL(2) q are given by the following: 

[4 4] = ±2.4, [J+, J-] = [J 3 l (8.7.6) 


where, by convention, the brackets in [/ 3 ] mean 




q k/2 — q k/2 

qi/2-q-if2- 


(8.7.7) 


The algebra of the quantum group SL(2) q reduces to the usual algebra in 
the limit q -* 1. This limit corresponds to taking Planck’s constant to zero. 
This also gives us a powerful way to see intuitively how quantum groups differ 
from classical ones. 

The representation of any quantum group necessarily mimics many of the 
properties of the usual group. In fact, many of the corresponding representa¬ 
tions for the quantum group are usually found by taking the usual representation 
and replacing all c numbers by their quantum analog given by the brackets. 

For example, the irreducible representations of SL(2) q are labeled by inte¬ 
gers or half-integers j and have dimensions 2j + 1. The representation space 
of the group is spanned by | j, m), just as in the ordinary case, and 


J±\j, m) = A/TmHjim + 1] | j, m ± 1). (8.7.8) 


In fact, if we label the various representations by V 7 , then the usual Clebsch- 
Gordan tensor product decomposition remains the same 


7i +72 

V' 1 ® V-' 2 = yJ - (8.7.9) 

7 = 171 - 721 


For these representations, V 7 , we will now introduce two operations that 
have their direct analog in the usual Lie group theory. Let the operator 
project the j th representation out of the tensor product of the j\ and j 2 
representations. In other words 

K J jl h : V 71 ® V 72 V 73 . (8.7.10) 

This is the counterpart of the usual Clebsch-Gordan coefficient. 

Similarly, let us introduce the operator R jljl to be the braiding operator that 
interchanges 1 and 2: 

Rjih . y7i 0 V72 yj 2 0 V 71. 


(8.7.11) 



258 8. Knot Theory and Quantum Groups 


As we mentioned earlier, the quantum braiding operator differs from the usual 
braiding operator found in classical Lie group theory because it picks up 
important phase factors when acting on tensor products. 

We now wish to construct identities for the K and R operators. We define 

Rhh RJU3 ftj2j3 _ ftj2j3 ftjlj 3 ftj\jl 

R j ' h KfJ 4 = KfJ 4 R hh R iih , (8.7.12) 

K J j [jl R j2jl = (— 1 y i J 2 ~jq( c J ~ c j\ ~ C J2 )/ 2 Kf jx . 

We recognize the first relation as the Yang-Baxter equation. The first two 
relations are operator expressions acting on the tensor product of three 
representations 

\ h ®\ h ®\ h . (8.7.13) 


The last relation can be understood by examining it graphically, that is, it 
represents the twisting of the two legs of a three-vertex. 

The advantage of using quantum groups is that they give us an explicit 
expression for the “universal R matrix”: 


R = y i - q —l- x q- n(n -' )/4 q nj2/4 (J + ) n ® q~ nhl \]S) n . 

(8.7.14) 

The R hn matrix can be defined in terms of the universal R matrix by evaluating 
it on the tensor product V-' 1 <g> V-' 2 (and permuting indices). 

Let us compare these relations for the K operation with the twist of two legs 
of a three-point function. In the WZW model SU{ 2)*, we find that twisting 
two legs yields a phase factor 


Aji Ay 2 )]> 


j exp[27ri(A, - - ^ h> 

where the conformal weight of the primary field is given by 

j(j + 1) 




k + 2 


(8.7.15) 

(8.7.16) 


Comparing the two expressions for twisting the legs of the three-point func¬ 
tion for SL(2) q in Eq. (8.7.12) and the WZW model in Eq. (8.7.15), we are then 
led to postulate that they are the same, provided we make the identification 

q g2"/(*+2>. (8.7.17) 


In fact, by examining other identities as well, we can show that the braiding 
properties of the WZW model at level k are determined by the representation 
theory of quantum groups if we make the above correspondence. As we men¬ 
tioned, the presence of the crucial phase factors in the braiding operation in 
Eq. (8.7.12) separates a quantum group from an ordinary Lie group. 

We can also make a trivial check on the relationship between quantum 
groups and SU(l) k . If we take the limit q -* 1, the quantum group reduces 



8.7 Knots and Quantum Groups 259 


to an ordinary group. For the WZW model, we find from Eq. (8.7.17) that the 
corresponding limit is k oo, which is also the limit in which a Kac-Moody 
algebra reduces to an ordinary one. 

This relationship is quite remarkable, and once again it reveals the richness 
of conformal systems. On the left is the parameter labeling the quantum group 
SL(2 ) q , and on the right, we have an expression labeling the level number k 
of SU(2) k . Apparently, there is a deep relationship between these two, which 
enables us to compute the values of the B and F matrices exactly. 

In fact, with a bit of work, we can find the exact numerical value of the 
B and F matrices for SL(2) q . In this way, we can make a correspondence 
between the B and F matrices of SU(2) k and the 3 j and 6 j coefficients for 
SL(2) q . For example, it is possible to show that the B and the F matrices can 
be represented exactly as follows: 


71 J2 J 
h U 7 ' 


f "[ j > *]=[/ 


(Cji+Cji-Cj-Cj ')/ 2 


J2 J 1 J 

h U /’ 


(8.7.18) 


where the matrices on the right are the deformed analog of the usual 3 j and 6j 
symbols found in ordinary quantum mechanics. Specifically, these deformed 
3 j and 6 j symbols can be written as [24]: 


h h H = y/V-jn + 1 ][2 J23 + l](-iy ,+jW_;i2 

_ 73 7 723 J 

x A(y*i, y 2 » J12)A(7*3, 7, 7 i 2 )A(7i, 7, 7 2 3)A(7 3 , ji, 723) 

x + l V-{[z - h ~ H ~ jnV-[z - h ~ J - 712]! 

z>0 

x [z - 71 - j - jxV-lz - h ~ h ~ jnV-lji + h + h + j — zV- 
x [ji + h + 723 — £]!L/2 + j + 712 + 723 — z]l} , (8.7.19) 

where 

A (a, b , c) = y/[—a + b + c]l[a — b + c]l/[a + b + c + 1]!. (8.7.20) 


Thus, it seems near miraculous that the deformation of the 3j and 67 
symbols found in ordinary quantum mechanics can provide us with explicit 
representations of the B and F matrices found in conformal field theory. 

We have tried to present a discussion of quantum groups that stresses the 
intuitive relationship between them and Kac-Moody algebras. However, before 
concluding our remarks about quantum groups, let us generalize some of our 
previous statements. 

The algebra of a quantum group is a special case of a larger class of alge¬ 
bras, that is, an associative Hopf algebra A, which has three operations that 
generalize the usual operations of multiplication and of taking the inverse: 



260 8. Knot Theory and Quantum Groups 


(1) The relation A (comultiplication) maps 

A: A -+ A <g> A A (ab) = A(a)A(b). (8.7.21) 

Here, A is the generalization of the addition of angular momentum. This 
rule defines the associativity of comultiplication. 

(2) The relation y (antipode) maps 

Y : A -* A y(ab ) = y(b)y(a). (8.7.22) 

This rule defines the antipode, which is the generalization of the inverse. 

(3) The relation e (co-unit) maps 

<?: A ->• C, €(ab) = e(a)e(h), (8.7.23) 


where a, b € A. 


These three relations satisfy the following: 

(id <g> A)A(a) = (A ® id)A(a), 
m(id ® y)A(a) = m(y ® id)A(a) = e(a) 1, (8.7.24) 

(e ® id)A(a) = (id <g> e)A(a) = a, 

where m is the multiplication in the algebra. This general Hopf algebra, 
based on these multiplication rules, becomes a “quasi-triangular Yang-Baxter 
algebra” if we make a few more restrictions on these operations. 

Let a represent the permutation map 


cr(x ® y) = y ® x. (8.7.25) 

Then, we realize that A' and a ■ A are two different comultiplication operators. 
First, we define the universal R as the operator that establishes the link between 
A' and a ■ A by conjugation 

a ■ A{a) = RA{a)R-\ (8.7.26) 

Second, we impose the following conditions: 


(id ® A)(/?) — (A <8> id)(/?) = /?u/?23, 

(y ® id)(/?) = R~\ 


(8.7.27) 


[i.e., (id 0 A )(R) e A 0 A 0 A, so that R l3 acts as the identity on the second 
factor and R in the first and third factors.] 

These definitions, of course, exist independent of conformal field theory 
and were introduced because they generalize the usual definitions of multi¬ 
plication, etc. Their application to conformal field theory arises as follows. In 
ordinary Lie group theory, when taking the tensor product of a large number 



8.7 Knots and Quantum Groups 261 


of representations, we would like to know when the resulting product yields 
irreducible representations and how many copies of each irreducible represen¬ 
tation appear. In general, this is a difficult question. However, one convenient 
way in which to analyze this question is to construct the “centralizer.” 

If we are taking tensor products of representations V 7 , then let A and B 
be two algebras that act on these spaces and map V -> V. Then, roughly 
speaking, B is the centralizer of the action of A if it commutes with the action 
of A [24]. For ordinary Lie groups, this usually means that the centralizer 
B is the set of braiding operations generated by permutations on the factors 
appearing in the tensor product V ® V ® V 0 • • •. The important point is 
that the irreducible representations appearing in this large tensor product are 
labeled by the irreducible representations of B. This gives us a convenient way 
in which we can quickly see the irreducible representations arising from this 
tensor product. 

Now, let us reanalyze Eq. (8.7.26) from this perspective. For our purposes, 
we notice that R is the centralizer of the quantum group because it commutes 
with the comultiplication A. This gives us a general way in which to extract 
the quantum group associated with a conformal field theory. Given the fusion 
rules of any conformal field theory, the tensor products obey certain braiding 
operations. By treating the braiding operator R as the centralizer of an algebra, 
we can construct the quantum group via Eq. (8.7.26). 

The braiding operator R , however, obeys certain complicated relations given 
by the 6 j rules. These hexagon and pentagon rules must also be part of the 
definition of the quantum group. In this light, we can reanalyze Eq. (8.7.27), 
which we now recognize as containing the information of the hexagon graphs. 

In sum, by using the theory of centralizers appearing in ordinary classical Lie 
group theory, we are led to define the braiding operator R as the centralizer 
of the quantum group since R commutes with the comultiplication opera¬ 
tion defining the quantum group in Eq. (8.7.26). Second, the 6 j polynomial 
equations found in Chapter 3 are now reinterpreted as Eq. (8.7.27). 

There still remains one last step. We have to show that Eqs. (8.7.26) and 
(8.7.27), which restrict the Hopf algebra, generate the Yang-Baxter relation. 
This can be accomplished in a few steps. Let us define the R operator as 

R = J2 a ‘® b i- (8.7.28) 

i 

Let the R operator become R t j when it acts on the specific representa¬ 
tions i and j. We are interested in its action on the tensor product of three 
representations V 1 , V 2 , and V 3 . For example, we have the following identity 
[24]: 

#13^23 = ^2 Yl ai ® a i ® b i b r (8.7.29) 

* j 

(Since the index 3 appears twice on the left-hand side, we notice that the third 
factor in the product contains the ordinary product of two V s.) 



262 8 . Knot Theory and Quantum Groups 


Now, let us construct the following sequence of operations, using only the 
definitions appearing in the Hopf algebra 

(ct • A <g> id)/? = ^2 A '(a,-) < 8 > b t 

i 


= R\2 ^2 A (°') ® b ‘ R l2 

i 

= R u (A ® id(R))R^ 

= R n Ri 3 R23Rn- (8-7-30) 

But, we also know 

(cr • A 0 id )(R) = < 7 i 2 [(A ® id)(i?)] 

= ^ 12 (^ 13 ^ 23 ) = ^23^13- (8.7.31) 

By equating these two expressions, we now have 

^12^13^23 = ^23^13^12 (8.7.32) 

which is the Yang-Baxter relation. 

In summary, given a conformal field theory, we can always define the braid¬ 
ing operator as the universal R matrix and Ry. Treating the braiding operator 
as a centralizer, we can then construct the quantum group by the statement that 
R commutes with the comultiplication defined on the quantum group; R ij9 in 
turn, satisfies the Yang-Baxter relation. 


Example: SU(2) q 

Some examples will help to clarify these rather arbitrary definitions, which 
were originally motived by examining integrable systems. For SU(2) q , for 
example, on the generators of the algebra, 

[X + , X~] = [//], [H, X*] = ±2X ± , (8.7.33) 


we can define the comultiplication, antipode, and co-unit operations 


A (H) = H <g> 1 + 1 <g> H, 

A(X?) = Xf ® q H/4 + q ~ H/4 ® Xf, 
*{H) = €{X ± ) = 0, €(1)=1, 

Y(H) = -H, y(X ± ) = - ? ± 1 / 2 Z ± . 


(8.7.34) 


We can show that the universal R matrix given earlier satisfies Eqs. (8.7.26) 
and (8.7.27) if we choose the above rules for the comultiplication, antipode, 
and co-unit operations. 



8.8 Hecke and Temperley-Lieb Algebras 263 


8.8 Hecke and Temperley-Lieb Algebras 

So far, we have seen that the Yang-Baxter relation, in various forms, appears 
at the very heart of knot theory, the polynomial equations of conformal field 
theory, quantum groups, etc. 

However, it turns out that the generators that we have been studying actually 
obey additional relations beyond those which define the Yang-Baxter relation. 

In this section, we will try to investigate these additional constraints and 
rigorize some of our discussion of the Yang-Baxter relation by introducing 
the Hecke and Temperley-Lieb algebras, which allow us to systematically 
explore the mathematical structure of these various formulations. This will 
allow us to tie up the various loose ends and link up the various themes that 
we have stressed in the past two chapters. 

A Hecke algebra is one in which the generators a obey the following 
relation: 


PiPi±\Pi — Pi±\PiPi±U 

Hecke algebra: PiP } = PjPi, (8.8.1) 

pf = (1 -q)Pi + q- 

We immediately recognize that the first relation is equivalent to the Yang- 
Baxter relation. The first two relations, in fact, are nothing but the braid 
relations. However, the third relation places a new constraint on the generators 
beyond the usual braid relations. 

We will find it useful to construct yet another algebra out of the Hecke 
algebra, called the Temperley-Lieb algebra. 

We start with the generators p t of the Hecke algebra and then impose one 
more additional constraint 


Pi Pm Pi — Pi Pi + 1 — Pi + 1 Pi + Pi + Pi- i-i — 1 — 0. (8.8.2) 


Let us define the generators e t of the Temperley-Lieb algebra as follows: 


&i — 


1 ~Pi 

1+9 


(8.8.3) 


With this new constraint and definition, we can show that these new 
generators e t satisfy 


e i e j = eje i9 | i - j | > 2, 

Temperley-Lieb algebra: e] = e t , (8.8.4) 

A'A'ilA' — ft &ii 

where ft = 2 + q + q~ l . 

The advantage of introducing the Hecke and Temperley-Lieb algebras is that 
we can now rigorously isolate the mathematical form that the Yang-Baxter and 
braid relations take in conformal field theory, quantum groups, and knot theory. 



264 8. Knot Theory and Quantum Groups 


Let us now recast our previous discussion of quantum groups in terms of 
the Hecke and Temperley-Lieb algebras. If we study SL(N) q , for example, 
we find that the R matrix commutes with the comultiplication operator A, that 
is, the Yang-Baxter operators are the centralizers of SL(N) q . More precisely, 
the centralizer of SL(N) q is given by the Hecke generators p z with parameter 
q ' 

To see this, let us find an explicit representation of this algebra. We introduce 
the symbol etj, which is an N x N matrix such that the only nonvanishing 
element is equal to 1 and is located in the /, j position in the matrix. Because 
e t j actually has four sets of indices, we will suppress the N x N indices. Now 
define the R matrix as 

r = ^2 €ij ® e J ‘+^ 1/2 e " ® e “ +<v /2 - <r 1/2 ) X ! e jj ® e ‘<- ( 8 - 8 - 5 ) 

i^j i i<j 

(Notice that R is the tensor product of two separate N x N matrix spaces.) 
Now define the multiplication operation on R such that the two individual 
tensor spaces multiply separately, so that R 2 lies in the same space as R . 

Then it is an easy matter to show the following: 

/? 2 = (^ 1/2 — ^ _l/2 )/? H- 1. (8.8.6) 

Now define the p t matrix as follows: 

Pi = -{q) yl 1 ® • ■ ■ ® Ri,i+1 ® ■ • • ® 1, (8.8.7) 

where the indices i, i 4- 1 indicate the spaces in the tensor product where p t 
acts nontrivially. 

Then, by explicit calculation, we can show that the p t satisfy the defining 
relations of the Hecke algebra. Thus, the Hecke algebra and quantum groups 
are related in the most intimate way via the centralizer. 

However, there is one additional constraint that we must impose if we wish 
to study SU(N) q . The irreducible representations of SU(N) q have the same 
Young tableaux as the representations of the classical Lie group SU(N). The 
difference, however, between the two sets of Young tableaux is that they have 
different operators which symmetrize the indices of the tableaux. For an ordi¬ 
nary Lie group, ordinary transpositions can symmetrize the indices, which form 
the symmetric group S n . However, for the quantum group, the symmetrizer is 
given by the braiding operator which interchanges the indices. In other words, 
the symmetrizer is given by a generator of the Hecke algebra. However, we 
know from ordinary classical group theory that the Young tableaux can have at 
most N rows. In other words, the N + 1 row antisymmetrizer vanishes. Since 
manipulations of the Young tableaux are given by the generators of the Hecke 
algebra, this in turn, gives us an additional constraint on the generators. 

Written out explicitly, we find that the extra constraint on the generators is 
given precisely by Eq. (8.8.2). But this extra constraint allows us to rewrite the 
Hecke generators in terms of the Temperley-Lieb generators. Thus, the algebra 



8.8 Hecke and Temperley-Lieb Algebras 265 


of the centralizers that we have been studying in this chapter for SU(N) q is 
actually the Temperley-Lieb algebra [24]. 

Another example where the Hecke and Temperley-Lieb algebras play an 
important role is in knot theory, where the Hecke relations are equivalent to 
the skein relations. We recall that the skein relations express a relationship 
between knot invariants defined for L ± and L 0 . Let us call the knots K± and 
K 0 formed by L± and L 0 by tying opposite pairs of strings together (such that 
K+ consists of a link, K 0 consists of two separate unknots, and K 0 reduces 
to a single unknot). Then these three knots are related to each other by the 
following operations: K+ = and K 0 = <?\K _. Now let us examine the 

skein relation, defined in terms of the knot polynomials and the variable t. 
Therefore we have the relation 


K+-qK-=(g-l)K 0 , (8.8.8) 

which, in turn, is equivalent to the relation o\ — {q — l)<7i + q. This rela¬ 
tion, in turn, is identical to the constraint found in the Hecke algebra in Eq. 
(8.8.1). In this way, starting from the skein relation, we recover the constraint 
which defines the Hecke algebra. This shows the relation between these two 
formalisms. 

In statistical mechanics, we also see the importance of these algebras. Pre¬ 
viously, we were able to show that the vertex models could be written in 
terms of an S matrix S r p s q and that the IRE models could be written in terms 
of w(p , q\r, 5 ). Both these operators, in turn, could be expressed in terms of 
Xi(u) 9 which in turn obeyed the braid relations. 

However, these Xi(u) operators can be shown to obey the Temperley-Lieb 
relations. We recall that these operators obeyed the braid relations 

XiWXjiv) = Xj(v)Xi(u), 1 1 -j | > 2, 

Xi(u)Xi+i(u + v)Xi(v) = Xi+i(v)Xi(u + v)X i+ i(ii). 

Now let us redefine 

Xi(u) = / + /? 1/2 [sin w/sin(A — u)]e i9 

where e x obey the Temperley-Lieb relation, and where r — 4 cos 2 X. Thus, the 
operators of the IRF models obey the Temperley-Lieb algebra. 

Let us now summarize the results of the last few chapters and isolate once 
again the remarkable relationship between the Yang-Baxter relation (or more 
specifically, the Hecke and Temperley-Lieb algebras) and a bewildering variety 
of quantum systems, such as conformal field theory, quantum groups, knot 
theory, soliton theory, and statistical mechanics. 

First, we first encountered these relations when we studied rational confor¬ 
mal field theories for c < 1 and then constructed their conformal blocks, which 
are the building blocks for the theory. These conformal blocks, in turn, obeyed 
certain identities when we twisted or pinched internal lines. These identities 
were called the polynomial equations, such as the pentagon and hexagon rela- 


(8.8.9) 

( 8 . 8 . 10 ) 



266 8. Knot Theory and Quantum Groups 


tion. The algebras obeyed by these twisting and pinching operations, in turn, 
were equivalent to the Yang-Baxter relation. 

Second, we investigated various statistical mechanical models, such as the 
Ising model or the RSOS model. We found that the partition function for these 
models could all be expressed in terms of the transfer matrix. The essential 
feature which allowed us to solve these models exactly was that the transfer 
matrices commuted. The algebraic statement of commuting transfer matrices, 
in turn, was the Yang-Baxter relation. 

Third, we studied soliton theory, which is exactly solvable, topologically sta¬ 
ble nonlinear solutions of two-dimensional wave equations. Again, we found 
that the dynamics of their evolution could be governed by a transfer matrix, 
and that the essential feature which made the system solvable was commut¬ 
ing transfer matrices. Expressed mathematically, this gave us the Yang-Baxter 
relation. 

Fourth, we investigated knot theory. The essential step in knot theory was 
to cut the knot and form a braid. Then we could systematically deform the 
topology of the knot by braiding the strands, that is, using the braid relations 
of Artin. Defining a trace operation on the braid relations, in turn, gave us 
the various knot polynomials, such as the Jones, HOMFLY, and Kauffman 
polynomials. These braid relations, in turn, were identical to the Yang-Baxter 
relation. 

Fifth, we investigated quantum groups, which differ from ordinary Lie 
groups by a continuous parameter q. The relation to the Yang-Baxter relation 
could be established in a number of ways. Crudely, we could say that the Jacobi 
relation found in ordinary classical group theory corresponds to the Yang- 
Baxter relation for quantum groups. We also saw that the Clebsch-Gordan 
coefficients created by taking tensor products of various representations obeyed 
certain braiding relations when we twisted lines, which gave us a one-to-one 
correspondence between these coefficients and conformal blocks. We also 
found that from the generators, we could define a comultiplication operator A. 
We found that the R matrix commutes with A: RA = AR. We say that the 
Temperley-Lieb algebra is the centralizer of the quantum group. 

In summary, we find that the essential reason why these various two- 
dimensional systems are exactly solvable is because they are ultimately based 
on the Yang-Baxter relation, or, more precisely, on either the Hecke algebra 
or the Temperley-Lieb algebra. We summarize this by 


conformal field theory —> polynomial equations -> Yang-Baxter, 
statistical mechanics -> commuting T matrices -> Yang-Baxter, 
soliton theory —> commuting T matrices -+ Yang-Baxter, 
knot theory braid relation -> Yang-Baxter, 

quantum groups centralizer -* Yang-Baxter. 


( 8 . 8 . 11 ) 



8.9 Summary 267 


8.9 Summary 

The latest method by which to categorize conformal field theories is the use of 
Chem-Simons gauge theory and knot theory. It is the only method that tries 
to explain conformal field theory by starting with simpler structures in three 
dimensions to explain the “accidents” of two-dimensional physics. 

Our starting point is the action 

L = - k - f * iJk Tr[^(M* - d k Aj) + I Ai[Aj, A k ]] (8.9.1) 

which is generally covariant without introducing a metric tensor (because e ijk 
transforms as a tensor density). The physical states will then be composed of 
Wilson loops 


W R (C) = Tt P expi j Aidx ' (8.9.2) 

and the correlation functions consist of invariants defined with the topology of 
knots and links 

(n ^ = / DA exp(iZ-) n (8-9-3) 

To quantize the system, in the Coulomb gauge, we can impose the gauge 
constraint 


€ lJ Fij = 0 (8.9.4) 

which shows that the physical space is spanned by the moduli of flat curvatures 
modulo gauge transformations. This space is well known to mathematicians, 
and it is the space of conformal blocks. 

If we take a time slice of a three-dimensional space, the resulting surface 
has a complex structure, and the Hilbert space consists of conformal blocks. 
Thus, we have made a transition from a theory of infinite degrees of freedom 
to a topological system of finite degrees of freedom. 

We can also make the link to the WZW model by solving the gauge 
constraint. A solution is given by 

A = —(dU)U~ l (8.9.5) 


which has zero curvature. Assuming £ is bounded, we can plug this into the 
original action and find 


S = kSi 


WZW 


= ■£- f Tr(t/ _1 d+UU- 

4 7T Jsy 


T -1 


duy, 


d t )d<t>dt + ^~ [ Tr (tr 
I2n J Y 

(8.9.6) 

where </> is an angular variable defined around the perimeter of E. This is a 
version of the WZW action. Thus, all the previous results on conformal field 



268 8. Knot Theory and Quantum Groups 


theory, including a complete representation of the rational conformal field 
theories, emerge from Chem-Simons gauge theory. 

Because the states are completely generally covariant, the correlation func¬ 
tions must be topological invariants defined on knots and links. The classical 
Gauss linking number 

4 >(Ca,C b ) = -j- f dx‘ f dx ] € ljk (X - y) * (8.9.7) 

4 * J Ca Jc b k-yl 3 

(which tells us the degree to which a series of knots is intertwined) emerges 
when we compute correlation functions. 

The goal of knot theory is to find a set of invariants (polynomials) that are 
in one-to-one correspondence with topologically distinct knots. The classical 
Alexander polynomial A, for example, is defined recursively via the skein 
relations 

A^ + — A/_ = (\/7 — l/\/F)A^ 0 . (8.9.8) 

A series of knot polynomials has been discovered within the last few years, 
beginning with the celebrated Jones polynomial, which satisfies 

(1/0 Vi.. - tV L+ = (Vi - 1/Vf) V Lo . (8.9.9) 

The HOMFLY polynomial contains both the Jones and Alexander polynomials 
as subsets and satisfies 

tP L+ -r l P L _=zP Lo , (8-9-10) 

where L± and L 0 are defined in Fig. 8.2. The Kauffman polynomial R L is 
actually defined on knots made of ribbons (framed knots) and satisfies 

Rl + -Rl^=zR Lo (8.9.11) 

with the additional condition 

R L+ =aR Lo , R t _=a- l R io , (8.9.12) 

where the L± and Lo are given in Fig. 8.5. Fortunately, quantum field theory 
can generate all these knot invariants (and give analytic expressions for all of 
them) and infinite classes of new ones. 

If we quantize the Chem-Simons theory covariantly, we have 

KWm) = < 8 - 9 - 13 > 

so we can, by brute force, power expand the Wilson loops and take the matrix 
elements of the power series. We find, to first order in l/£, 

(W(C)) = —Tr (R b R a )j> dx» j y dy^A^Alix)) 

2tc 

— -dim Rc 2 {R) 4 >(C ), 

k 


(8.9.14) 



8.9 Summary 269 


where C 2 (/?)l = R a R a and 

* = (8 ' 915> 

So, the Gauss linking number comes out of the 1 /k expansion of the Chem- 
Simons correlation functions over Wilson loops. 

The complete correlation functions over Wilson loops can be shown to 
satisfy 


$L + — 

SL-=<x~ 1 Sl 0 , (8-9.16) 

ps L+ -r l s L _ = zs Lo , 

which, in turn, generate a new knot invariant. We can relate this new knot 
invariant to the HOMFLY invariant via 


P L {t = a, 0, z ) = cT^SUa, 0, z). (8.9.17) 


(This new knot invariant is defined only on ribbons, or framed knots, because 
quantum field theory is ambiguous when two strands are defined at the same 
point.) 

An even larger class of knot invariants can be constructed via conformal field 
theory. We first note that the correlation functions of conformal field theory 
obey certain relations when we change the order of the points Zi : 


i 

Vl(zu • •• > Zi+\,Zi, . . . ,Zn) = , Zi, Zi+U ..., Zn)- 

7 = 1 

(8.9.18) 

These B matrices, in turn, form a representation of Artin’s braid group. 
(Braids can be turned into knots by wrapping the ends of the various strands 
together.) In addition, by examining the 6 j transformation rules of rational 
conformal field theory, we can generate a representation of the Yang-Baxter 
relation. Let the B matrix correspond to twisting two external lines of a four- 
point correlation function, and let F represent the matrix corresponding to 
fusing two legs (so an s -channel graph turns into a t -channel graph). Then, 
these matrices satisfy 



J 3 
ji 


B 


JU9 


h 7*4 d . h h 

P js J pn L ji 79 _ 


J1P 


7*3 

74 b ■ 

h 

74 

R 

72 

7*3 

.76 

75 Mt j 

.7i 

p. 

°PJ9 

_ 76 

7*5 _ 


(8.9.19) 


which is the Yang-Baxter relation. 

Notice that the topology of the Yang-Baxter relationship has precisely 
the topology found in Artin’s braid group representation. Thus, we suspect 



270 8. Knot Theory and Quantum Groups 


that a representation of the knot invariants should be possible via statistical 
mechanics. 

To show this, we note that if two closed braids represent the same link, then 
it is possible to deform one link into the other link by a succession of Markov 
moves. Thus, a knot invariant must necessarily remain invariant under these 
Markov moves: 


a(AB) = a(BA), 
oc{AU n ) = a(AU; 1 ) = a(A). 


(8.9.20) 


The problem, therefore, is to find an object in statistical mechanics that is 
invariant under these Markov moves. However, we notice that the matrices 
found in statistical mechanics obey precisely the relations of the braid group 
because they satisfy the Yang-Baxter relations. Thus, given a braid A in this 
representation, the knot invariant associated with A is given by a trace 

a(A) = 2 ]"- 1 j t (HA). (8.9.21) 

One advantage of this formalism is that we have a vast number of statistical 
mechanical models obeying the Yang-Baxter relationship and, hence, forming 
a representation of the braid group. By forming the appropriate traces over 
these braid matrices, we can generate all the known link polynomials, as well 
as infinite classes of new link polynomials. 

Last, we note that when constructing modular invariants out of Kac-Moody 
algebras in terms of primary fields, much of the information concerning the 
structure of the group is lost in the process. We only need to manipulate the 
primary fields and their infinite descendants in order to construct these charac¬ 
ters. However, we are not reducing the Kac-Moody algebras to ordinary Lie 
algebras, because the latter is not compatible with the central charge. Thus, 
the reduction process cannot yield the usual Lie algebras, but something more 
general. These are the quantum groups, which were originally discovered by 
examining the Yang-Baxter relation. 

The defining relation of the quantum group SU(2) q is 

[fy, /±] = ±2 J ± , [J + , J-] = [J 3 ], (8.9.22) 


where the brackets in [fy] mean 


q V 2 _ q -V 2 
q'/i-q-V 2 ' 


(8.9.23) 


Because only the last commutator is changed, much of the representation of the 
quantum groups is identical to the usual representation of the classical groups. 
Notice that in the limit of q -> 1, we retrieve the usual classical theory. 

Last, by examining the “Clebsch-Gordan” coefficients generated by these 
algebras, one can make an association between the q found in quantum groups 
and the k found in Kac-Moody algebras 


q e 2 «‘ /(k+2) 


(8.9.24) 



References 271 


References 


1. E. Witten, Comm. Math. Phys. 121, 351 (1989). 

2. S. Elitzur, G. Moore, A. Schwimmer, and N. Seiberg, Nucl. Phys. B326, 108 
(1989). 

3. G. Moore and N. Seiberg, Phys. Lett. 220B, 422 (1989). 

4. S. Moran, The Mathematical Theory of Knots and Braids , North-Holland, 
Amsterdam (1983). 

5. D. Rolfsen, Knots and Links , Publish or Perish, Berkeley (1976). 

6. J. H. Conway, in Computational Problems in Abstract Algebra, Pergamon, Oxford 
(1970). 

7. J. W. Alexander, Proc. Natl. Acad. Sci. 9, 93 (1928); Trans. Amer. Math. Soc. 20, 
275 (1923). 

8. E. Artin, Ann. Math. 48, 101 (1947). 

9. V. F. R. Jones, Invent. Math. 72, 1 (1983); Bull. Amer. Math. Soc. 12, 103 (1985); 
Ann. of Math. 12, 239 (1985). 

10. J. S. Birman, Invent. Math. 81, 138 (1985). 

11. P. Freyd, D. Yetter, J. Hoste, W. B. R. Lickorish, K. Millet, and A. Ocneanu, 
Bull. Amer. Math. Soc. 12, 239 (1985). 

12. L. Kauffman, Topology 26, 395 (1987); On Knots , Princeton University Press, 
Princeton, NJ (1987). 

13. E. Guadagnini, M. Martellini, M. Mintchev, Nucl. Phys. B330, 575 (1990). 

14. P. Cotta-Ramusino, E. Guadagnini, M. Marellini, M. Mintchev, Nucl. Phys. B330, 
557 (1990). 

15. A. Tsuchiya and Y. Kanie, in Conformal Field Theory and Solvable Lattice 
Models , Advances in Studies in Pure Mathematics 16, 297 (1988); Lett. Math. 
Phys. 13, 303 (1987). 

16. G. Moore and N. Seiberg, Lectures on RCFT , 1986 Summer Trieste Summer 
School. 

17. Y. Akutsku, T. Deguchi, and M. Wadati, Phys. Rep. 180, 248 (1989); J. Phys. 
Soc. Japan. 56, 3039 (1987); 57, 757 (1988); 57, 1905 (1988). 

18. J. Frohlich, Nonperturbative Quantum Field Theory, 1987 Cargese Lectures, 
Plenum Press, New York (1987). 

19. A. A. Markov, Recueil Math. 1, 73 (1935). 

20. V. G. Drinfeld, Proceedings of the International Congress of Mathematics , 
Berkeley, CA (1986). 

21. M. Jimbo, Lett. Math. Phys. 10, 63 (1985); 11, 247 (1986); Comm. Math. Phys. 
102, 537 (1986). 

22. L. D. Faddeev, N. Yu. Reshetikhin, and L. A. Takhtajan, LOMI preprint E-14-87 
(1987). 

23. A. Kirilov and N. Yu. Reshestikhin, LOMI preprint E9-88 (1988). 

24. L. Alvarez-Gaume, C. Gomez, and G. Sierra, Nucl. Phys. B330, 347 (1990); 
Phys. Lett. 220B, 142 (1989); “Topics in Conformal Field Theory,” in Physics 
and Mathematics of Strings, World Scientific, Singapore (1990). 



Part II 


Nonperturbative Methods 




CHAPTER 9 


String Field Theory 


9.1 First Versus Second Quantization 

Although the methods of conformal field theory have given us a wealth of 
possible string vacuums and a framework in which to begin phenomenology, 
there are still severe deficiencies in this formulation. First, conformal field 
theory is necessarily a perturbative formulation. It is based on the first quantized 
string model propagating on various compactified manifolds. The problem is 
that the first quantized functional formulation [Eq. (1.2.1)] is based on the sum 
over conformally inequivalent Riemann manifolds of genus g , which yields 
a perturbative series of Feynman diagrams. The success of this formulation 
is that it yields a finite formulation of gravity interacting with quarks and 
leptons. However, its drawback is that millions of conformal field theories 
can be constructed using the methods presented in the previous chapters, and 
there is absolutely no concrete way in which to choose which, if any, of these 
millions of vacuums corresponds to our real world. 

What is needed, however, is a second quantized string field theory [1,2] 
that is not necessarily wedded to the sum over Riemann surfaces. We saw 
that perturbation theory by itself was not sufficient to compactify 10- or 26- 
dimensional space-time to a realistic four-dimensional manifold. Thus, an 
entirely new approach is required which allows us to calculate nonperturbative 
results, which we hope will be able to tell us which of the millions upon millions 
of conformal field theories are stable and which one, if any, our universe prefers. 

Second, it is not clear whether the perturbation series makes any sense. 
Investigations of the high-energy behavior of the higher-order graphs indicate 
that the perturbation series is not Borel summable. Ordinary gauge theories, 
such as QED, QCD, or the electro-weak theory, are also not Borel summable, 
which means that we must treat their perturbation series as an asymptotic one. 




276 9. String Field Theory 


Although QED, for example, rapidly converges to the correct value of the S 
matrix for electron-photon processes at low orders in the coupling constant, 
eventually the perturbation series must diverge. This is not a problem for gauge 
theories, because we can always say that they must be embedded into a more 
realistic theory of the universe, which is Borel summable. 

However, since string theory makes the pretense that it is the unifying theory 
of the universe, we cannot take refuge by embedding it into a higher theory. 
The meaning of this is that perturbation theory around conformal field theory 
is a potentially dangerous path and that the final formulation of string theory 
must necessarily be nonperturbative. 

Third, the first quantized string at higher genus g is actually ill defined 
because of a century-old problem, the triangulation of moduli space. When we 
write the “sum over conformally inequivalent surfaces” in the path integral, the 
sum is actually ambiguous because of the problem of finding specific moduli 
for higher genus surfaces. The dimension of moduli space is well known, 
6g — 6 + 2A, but finding specific coordinates that implement this triangulation 
is exceedingly difficult to solve. For the past century, this problem in classical 
mathematics was unsolved. In the last few years, three triangulations of moduli 
space have been given: 

(1) Light cone coordinates: we will discover that the simplest string field 
theory, the light cone theory, solves this century-old problem in a simple 
way, via twists, string lengths, and propagation times [3]. 

(2) Harer coordinates: we will find that covariant open string field theory, 
given by Witten, implements this set of coordinates [4]. 

(3) Penner coordinates: so far, no string field theory can reproduce this set of 
coordinates [5]. 

In summary, we find that the first quantized functional in Eq. (1.2.1) is 
actually not well defined, but that the second quantized theory gives us explicit 
triangulations of moduli space. In fact, of the three known triangulations of 
moduli space in the mathematical literature, two of them come from string 
field theory. (The nonpolynomial theory, to be discussed in the next chapter, 
gives a fourth triangulation of moduli space, but the full details have yet to be 
worked out.) 

At present, there have been various proposals for a nonperturbative formu¬ 
lation of string theory, which we will discuss, such as string field theory [1, 
2, 6-10], which is a second quantized theory of strings. More nonperturbative 
formulations such as M-theory will also be presented later. 

For the general theory of relativity, the equivalence principle enabled Ein¬ 
stein to see that general covariance lay at the foundation of any theory of 
gravity. Then, it was straightforward to find the mathematical language in 
which to formulate the equivalence principle and general covariance. 

At the present time, the string counterparts of the equivalence principle 
and general covariance are still not known, and hence, this is the heart of the 
problem in finding the right framework in which to formulate the theory. In 



9.1 First Versus Second Quantization 277 


this sense, string theory has been evolving backward, ever since its accidental 
discovery in 1968. With these remarks, we now begin with a discussion of 
string field theory, and the difference between first and second quantization. 

A first quantized theory is formulated in terms of the coordinates describing 
a particle’s motion. For a point-particle, for example, its relativistic action is 
given by the invariant length swept out by its path. Let x M (r) represent a vector 
that points from the origin to the location of a particle. As the particle moves, 
it sweeps out a line, parametrized by r. The action is 


S = —m j dry ~ length, (9.1.1) 

which is invariant under reparametrizations of the path 

r -> f(r). (9.1.2) 

This reparametrization invariance allows us to select a particular gauge choice, 
which we may choose to be 


* 0 = 


(9.1.3) 


in which case the action assumes the familiar nonrelativistic form in the limit 
of small velocities 


S 



(9.1.4) 


The advantage of this nonrelativistic formulation is that the theory is manifestly 
ghost free because all references to x 0 (r) have been eliminated. 

The scattering amplitudes are defined by imposing, from the outside, the 
set of topologies over which the particle interacts, and then taking the Fourier 
transform 


A n = ^ f d/i Dx^x)^ f dtL e l 2 ^j Pj ' Xj , 

Topologies J 


(9.1.5) 


where the sum over topologies represents a sum over predetermined Feynman 
graphs. 

Several conclusions can be immediately drawn from this relatively simple 
example: 


(1) The first quantized formulation is necessarily perturbative. We must im¬ 
pose from the outside the set of Feynman paths over which to integrate, and 
each set of graphs represents a certain order in the perturbation theory. Non- 
perturbative phenomena cannot be seen to any finite order in perturbation 
theory. 

(2) The counting and coefficient of each graph is not clear. It is ambigu¬ 
ous which weights we assign to the various graphs and which graphs are 
included and which are excluded. 



278 9. String Field Theory 


(3) The first quantized formulation is not manifestly unitary. Although the 
free theory, via gauge fixing, can be seen to be totally free of ghosts, it 
is not clear that the final perturbative S matrix is unitary. (Hopefully, the 
constraint of unitarity will eventually fix the counting of all graphs.) 

(4) The first quantized formulation is basically on the mass shell. Thus, some 
of the most interesting questions are out of reach of the first quantized 
theory. 

For string theory, there is also an additional complication at the level of 
perturbation theory. In principle, the “sum over all conformally inequivalent 
Riemann surfaces” appears to be an elegant statement of how string pertur¬ 
bation theory is constructed. However, it tells us virtually nothing about how 
to set up coordinates for these genus g surfaces. In fact, as we mentioned 
earlier, choosing moduli that can triangulate the higher Riemann surfaces 
is a notoriously difficult mathematical problem, dating back to the time of 
Riemann. 

To remedy all these difficulties, we now pass to the second quantized theory. 
The first quantized formulation was based on the coordinates jc^, which de¬ 
scribe the motion of a point-particle. The transition to the second quantized 
formulation begins when we introduce a field 0(;t), which is a function of the 
coordinates. 

In contrast to the first quantized string theory, the second quantized string 
field theory has an explicit dependence on 4> 3 or higher terms, meaning that the 
interactions are all fixed ahead of time. The weights and measures are hence 
uniquely fixed, and unitarity to all orders in perturbation theory is almost trivial 
to show. Because of the presence of explicit interaction terms, we now have 
an explicit triangulation of moduli space in terms of string field theory. Thus, 
a century-old mathematics problem, finding the correct moduli for genus g 
Riemann surfaces, is almost trivially solved. 

The second quantized theory is also inherently an off-shell theory, so sym¬ 
metry breaking, in principle, can be investigated by the theory. In addition, one 
does not have to resort to perturbation theory. In fact, quantum field theory is 
the only formalism in which a variety of techniques have been developed to 
handle nonperturbative phenomena. 

In Chapter 1, we stressed that there are at least three ways in which a 
point-particle or a string can be quantized, the Gupta-Bleuler approach (where 
Lorentz covariance is maintained and ghosts are allowed to propagate, but the 
physical states must be ghost free), the light cone approach (where the theory 
is formulated entirely in terms of ghost-free physical states, but Lorentz invari¬ 
ance must be carefully checked), and the BRST approach (where covariance 
and unitarity are maintained by ensuring that the physical states are BRST 
invariant.) 

We start by trying to compute the canonical momenta corresponding to Eq. 
(9.1.1). We find that its momenta are not independent, but constrained 

Pn = SL/Skp, pl + tn 2 = 0. (9.1.6) 



9.2 Light Cone String Field Theory 279 


In the Gupta-Bleuler approach, we apply the constraint directly on the fields: 

(. pl+m 2 )d>(x) = 0 , (9.1.7) 

which is just the usual Klein-Gordon equation. This equation, in turn, can be 
derived from the standard covariant second quantized action 

S =\j d 4 x<p(x)[ - m 2 ]<}>(x). (9.1.8) 

The next method is the light cone approach, where ghosts are explicitly 
eliminated. We start with the gauge-invariant action 

s - J - \e(pl + m 2 )], (9.1.9) 

which is invariant under: 

SXft = exSp p = ep p , Se = d{ee)/dx. (9.1.10) 

By calculating the equations of motion for e and p, A and then eliminating them, 
we retrieve the usual first quantized action in terms of x p alone. 

Let us now, however, select the light cone gauge 

x + = z (9.1.11) 

and solve explicitly for p~ via the constraint 

P p +m 2 = p 2 — 2p~p + + m 2 = 0. (9.1.12) 

If we apply the gauge on the action, we find that the term p~x + becomes 
p~, that is, p is the new Hamiltonian in the light cone gauge. Thus, solving 
the constraint, we find that the Hamiltonian is given by 

H = p~ = ^(P'+m 2 ). (9.1.13) 

Let us now take the Fourier transform of the field with respect to x_, so that 
the field becomes <p p +(x,). Then, the equation of motion for the field, which is 
a function of only transverse fields, becomes 

{;^-//jvv(*,) = o (9.1.14) 

and the new second quantized action is given by the Schrodinger-like equation 

J D Xi dp+ {/ ^ H} ifpAxi). (9.1.15) 

9.2 Light Cone String Field Theory 

Next, we make the transition to the free string and find that there is a remarkable 
correspondence between the point-particle and the string approach. In fact, at 



280 9. String Field Theory 


the free level, the equations can be practically transported from one to the other. 
(The major complication, we shall see, comes at the level of the interactions, 
which are highly nontrivial.) 

The field theory of strings is based on <b(X), which is a functional, that is, 
it is a function of every point X^cr) along the string for all possible values of 
cr. Thus, the expression d>[X(<r)] is actually incorrect. The correct functional 
dependence is given by 

<*W = ^[X^d), X> 2 ),..., *>*)], (9.2.1) 

where we let N -> oo. 

We can also decompose this string functional in any basis we wish. The 
most convenient basis contains Hermite polynomials. We can write 

d>(X)=(X|c&(xo)), (9.2.2) 

where 

l*(JC0» = 0(*o)|O) + A^xoJaf |0) + g^afa^O) H-, (9.2.3) 

where xo is the usual four vector representing ordinary space-time. Here, we 
see the explicit decomposition of the field functional in terms of the tachyon 
field 0 (jc o ), the Maxwell field A M (x 0 ), a massive graviton field g MV (xo), etc. 

Let us now construct free actions for the string field in each of the three gauge 
formalisms that we studied in Chapter 1. In the Gupta-Bleuler approach, we 
wish to impose the following conditions: 

X' 2 

Pl + —r = o, P^X"* = 0. (9.2.4) 

By taking the Fourier moments of these constraints, we arrive at 

Ln !</>) = o, (L 0 — 1 ) 10 ) = 0. (9.2.5) 

We shall interpret the second of these constraints as the propagator of string 
states, so our action becomes 

S = j DX^ <t>(X){L 0 - 1 }<t>(X), (9.2.6) 

subject to the constraint that L n <&(X) = 0, and where 

dx »=n n dx =n n dx ^ a) - (9 2 7) 

fx n ix o 

Although the Gupta-Bleuler formalism is quite elegant, in actual practice, 
the elimination of the Virasoro constraints requires very difficult projection 
operators, whose complexity precludes their widespread application. 

The light cone gauge, because it has eliminated all ghost modes, does not 
suffer from this difficulty. In the light cone gauge, we start with the first 



9.2 Light Cone String Field Theory 281 


quantized action 


/ 


dz do 


- X 


p l + d 1 + PiV) 


7X 


(9.2.8) 


As before, by eliminating A, p, and P ^ via their equations of motion, we can 
show that the action is equal to the area swept out by the string in Eq. (1.1.7). 
We wish to impose 


Y + = p+r. (9.2.9) 

While solving explicitly for the constraints [Eq. (9.2.9)], we find that, as before, 
P~ becomes the new Hamiltonian 

H = j\a F-ia ) = ~ f da{pf + fl) , (9.2.10) 

and the equation of motion therefore becomes 

(* ^ - #) <V(X<) = 0, (9.2.11) 

and the free action becomes [1]: 

j DXt dp + d>J + (Z ; ) ^ - H^j V(Xi). (9.2.12) 

To generalize the light cone theory to interactions, however, requires a non¬ 
trivial extension of our results for the free theory. Some of the pioneers in 
quantum physics, such as Heisenberg and Yukawa, spent years trying to de¬ 
vise a nonlocal quantum theory. However, the problem with nonlocal theories 
is that they inevitably violate causality or relativity. When one vibrates one 
point in space, the interactions in these nonlocal theories travel faster than the 
speed of light. 

The light cone theory solves this perplexing question of maintaining both 
causality and relativity. The theory is not a nonlocal theory, in the usual sense, 
but it is a multilocal theory. Interactions do not violate causality or Lorentz 
covariance because strings break instantaneously, and the interactions travel 
down the strings at speeds less than or equal to the speed of light. 

The light cone interacting theory is based on the observation that open 
strings interact by breaking instantaneously at one point, or by reforming at 
their endpoints with other open strings. In Fig. 9.1(a), for example, we see the 
topology of the scattering of several strings in the light cone gauge, such that 
strings can only break in their interiors or reform at their endpoints. To actually 
see that this yields the usual A-point amplitudes, consider the conformal map 
that takes the upper half-plane and maps it into the configuration in Fig. 9.1(b). 



282 9. String Field Theory 


1 

2 


a 


5 

c •- 

4 


b 


3 


A 


z 


FIGURE 9.1. 


B 


We take the Mandelstam map [11]: 


N 

p(z) = ln te - Zi)- (9.2.13) 

i=\ 

Let us now derive this conformal map, which takes us from the upper half z 
plane to the complex p = r + io plane. The simplest way is via the Schwarz- 
Christoffel transformation. 

We recall that the Schwarz-Christoffel equation transforms the z plane into 
a polygon. Specifically, we wish to map the real axis onto the perimeter of this 
polygon. 

For an n -sided polygon, let Zi be n points along the real z axis. Each u will 
be mapped to a point p f in the p plane, which corresponds to the comers of the 
polygon. Let a t equal the interior angle of a comer of a polygon at point p t . 
For a square, for example, this angle is equal to tt/2. Then, the map that takes 
us from the upper half z plane to the complex p plane is given by 


dp(z) 

dz 


kfliz-zd ^- 1 

i =1 


(9.2.14) 


or 


p{z) — k f dz Y\(z - ZiT i/n \ 

Jzo i = 1 


(9.2.15) 



9.2 Light Cone String Field Theory 283 


where we have the condition 

N 

yof/ = (N — 2 )jt. (9.2.16) 

To understand the last condition, let us take the limit as z oo. In this limit, 
we wish the function p to be finite, therefore the exponent of z in this limit 
must be zero, which explains the previous condition. 

We notice that as z -* a i9 the mapping becomes 

dp(z)/dz ~ k(z - ZiT■ (9.2.17) 

Near z,-, let us assume that 

z-Zi~ee w (9.2.18) 


for small €. 

We see that a line on the real axis that approaches Zi, hops over it via a small 
hemisphere, and then continues along the real axis is mapped into a bent line in 
the p plane and rotated by angle Tt{oti /zr — 1), which forms an interior comer 
of the polygon with angle <x i9 as desired. 

Next, we will write the interaction Lagrangian for the light cone string field 
theory. For open strings, an examination of Fig. 9.2 shows that strings can join 
at their endpoints (or break at an interior point). The interaction Lagrangian is 
thus a d> 3 term, with a Dirac delta function sandwiched in between [1]: 


S 3 


where 


J f\dp l r ^p +r j S l23 <l> l (X l )* 2 (X 2 )*l(X 3 ) + h.c., 

(9.2.19) 


2 

S m = U FI ^[-^ 3 ( 03 ) — 0(na\ — o)X\{o\) — 9(a — 7ra 1 )X2(a 2 )], 

1=1 §<Gi<7ZCti 

(9.2.20) 

where the string variables are defined as 
G\ = a, 0 < a < not 1 , 

a 2 — o — izol\ , 7rci?i < a < n(a\ + ot 2 ), (9.2.21) 

or 3 = 7r(ai + a 2 ) - a, 0 < a < 7r(ai + oti). 


FIGURE 9.2. 



284 9. String Field Theory 


with the condition <*i — 0- Using the formalism developed by Mandelstam 
for light cone diagrams, we can then show that the above interaction is sufficient 
to derive most of the interacting amplitudes of string theory [11, 12]. 

The full open string theory, however, is more complicated than the closed 
string case in the light cone gauge. We necessarily must add four-string inter¬ 
actions and higher point interactions to the open string action. If we let d>(^) 
represent open (closed) strings, then the interactions for the open and closed 
strings symbolically have the structures [1]: 

Lopen = <D 3 + 4> 4 + C P 2 * + ^ + d>Vl/, 

3 (y.Z.ZZ) 

^closed — 'F . 

In other words, the open string vertex function by itself cannot generate all 
string amplitudes, so we must necessarily include closed strings as well. Thus, 
even if we started out with an open string theory without any gravitons, we 
find that gravitons necessarily creep back into the theory. There is no choice: 
string theory is by its very nature a theory of quantum gravity. 

There are several ways to see why the open string theory has five interactions 
and the closed string theory is cubic. The most direct way is to examine the 
string amplitudes to see if the postulated interactions reproduce the string 
theory. Let us take the real part of the Mandelstam map 

Re p{z) = r = ^2 a t In \z — Zi\- (9.2.23) 

i=i 

Notice that lines of equal r correspond to equipotential lines created by charged 
sources placed at n with charges proportional to a t . However, lines of equal r 
in the p plane trace the topology of the interacting string. Thus, by graphically 
examining the equipotential lines formed by charges placed on the perimeter 
of a circle or on the real line, we can trace the topology of interacting strings. 
The real part of the Mandelstam map is a map that takes the equipotential lines 
defined on a disk or the upper half-plane and maps them into the vertical lines 
in the p plane. 

By examining Fig. 9.3, we see that all five interactions must be present in 
the interacting string action. It is not hard to write the explicit form for all five 
interactions, since each is given by a Dirac delta function that describes the 
topological change described by the interaction. Thus, there is a finite region 
of moduli space that generates equipotential lines that cannot be described by 
three-string open vertices. This means that the integration region of the Koba- 
Nielsen variables Zi cannot be filled completely if we use only three-string 
vertices, that is, there are missing regions of the integration region that can 
only be filled by postulating four-string and higher interactions. 

However, the closed string situation is dramatically different. We find that 
with cubic interactions alone, we can completely fill the integration region 
of the Shapiro-Virasoro amplitude. This is highly nontrivial because it gives 
us the first triangulation of moduli space in over a century, thereby solving a 
long-standing mathematical problem dating back to Riemann. The amplitudes 



9.3 Free BRST Action 


285 



FIGURE 9.3. 


generated by the light cone string theory have automatically subtracted the 
redundancy due to the mapping class group. 

It may seem strange that the open and closed string light cone interactions 
have such different characteristics in string field theory. We will see that this 
situation becomes much more complicated with midpoint interactions, and that 
the closed string theory becomes nonpolynomial. One suspects that there must 
be a deeper, group-theoretical reason why open and closed interactions have 
such startling different structures. 


9.3 Free BRST Action 

The third method of passing from a first to a second quantized action is via 
the BRST method. Instead of quantizing the action expressed as the length of 
a point world line, we will quantize the following action instead: 

S = j dr (e~ x x 2 — em 2 ) , 


(9.3.1) 



286 9. String Field Theory 


which is invariant under reparametrization invariance, given by 

8xp = ei M , 8e = d(€e)/dr, (9.3.2) 

where e is a one-dimensional metric tensor on the world line of a moving 
point-particle. We will choose the gauge 

e = 1 (9.3.3) 


and calculate the Faddeev-Popov ghost action that emerges from this choice. 
The Faddeev-Popov determinant is given by 


App = det |3- 


l T \ = j DO DO exp i j dr0d T 0. 


Then, we calculate the BRST operator 

Q = 0(d^ -m 2 ) 

which must be applied onto physical states 

Q\4>(x,0))=Q. 


(9.3.4) 

(9.3.5) 

(9.3.6) 


Solving this constraint, one is left once again with only physical states. So, a 
natural choice for the BRST invariant action is given by 



I Dx^dO dO <pQ(f) 

(9.3.7) 

which is invariant under 

because Q is nilpotent. 

m = ew 

(9.3.8) 


Now, let us generalize our discussion to strings. Let us first take the 
conformal gauge 


g ab = 8 ab (9.3.9) 

which eliminates reparametrization invariance and also local scale invariance. 
The Faddeev-Popov ghost factor can be exponentiated by introducing two sets 
of anticommuting ghosts, b and c. The resulting first quantized action becomes 

L =-(d.X.diX 11 + bd- z c + bd z d) (9.3.10) 

71 

which possesses a global BRST symmetry. Its generator is given by Q, so that 
the physical states must satisfy 

Q\<Z>(X,b,c)) = 0. (9.3.11) 

At this point, there are two possible BRST actions. The first is the straight¬ 
forward generalization of the Gupta—Bleuler formalism, where we have 
[13-16]: 


L = (d>(X, b, c)\(L* + Ll h - 1)|<D(X, b, c% 


(9.3.12) 



9.3 Free BRST Action 287 


where we have explicitly split the L 0 operator in terms of its string and ghost 
oscillators, and where b, c ) can have any ghost number. 

However, there is also a second BRST formalism in which the gauge 
invariant is manifest. Let us choose the action [2]: 

S = j DX Db Dc Db Dc <DQ<D, (9.3.13) 

where <t> has fixed ghost number This field is therefore a truncation of the 
field in the previous BRST formalism. 

The advantage of this second approach is that one can see explicitly the 
gauge invariance of the theory. The theory is invariant under 

5<D = QA (9.3.14) 


because Q is nilpotent. 

To analyze the states within |4>), let us write the double vacuums |±) of the 
zero modes of the b and c oscillators 

Q)l+) = 0, bo\—) = 0, (9.3.15) 

where |+) = co|—). Then, the field |<t>) can be decomposed into two pieces 

|d>) = ^|-)+0l+), (9.3.16) 

where the and (j) have no zero modes. 

Let us now gauge fix the BRST gauge invariant theory in order to obtain 
the other two gauge fixed formalisms. We will first show that the BRST gauge 
invariant theory is, in fact, equivalent on shell to the light cone theory, which 
is defined totally in terms of physical transverse states [17, 18]. Also, we will 
discuss covariant gauges as well. 

We begin by noting that any operator E , that can be written as a BRST 
commutator 


E = [Q,S] ± (9.3.17) 

automatically vanishes on the BRST invariant states of the covariant field 
theory. To see this, we note that a BRST invariant state is one in which 


ei<h) = o 

|<h)^0|A> for all |A). 


(9.3.18) 


Let us now apply E onto a BRST invariant state 

£|ch) = <2S|<t>) =F 5f2l<t>> = GW = 0|A> = 0. (9.3.19) 


Thus, E annihilates BRST invariant states, modulo a gauge transformation. 

Now, we use the fact that there exists an operator of this type that can be 
written as 


E = N t - N, 


(9.3.20) 



288 9. String Field Theory 


where N T is the level number of a transverse state and N is the level number. 
The statement that 


£| 4 >) = 0 


(9.3.21) 


means that Nj = N on such states or that | <f>) is purely transverse. Our task, 
therefore, is to explicitly construct operators E and S that satisfy both Eqs. 
(9.3.17) and (9.3.20). 

Let us define the following operators: 


D n = 


dz z n 
Iniz k ■ P(z) 


(9.3.22) 


where P^(z) = ■Jniz{d/dz)X tl {—i lnz, 0). Then, it is possible to show the 
following explicit expressions for E and S : 


E = (Dq — 


OO 

1 )Lo + Jji D -nL n 

n =1 


OO 

-f- Z/_ W Z) W ) ^ ' /l(C— n C n "E 
n =1 


s = ^c_„(D n - Vo)- 

— OO 

(9.3.23) 

One can prove that E can be written in terms of transverse operators, as in 
Eq. (9.3.20), as well as in terms of D n . Notice that both forms of E obey the 
following commutation relations: 

\E m , E] = in E m , 

[D m ,E ] = ~rnD m , (9.3.24) 

[V^E] = 0, 

where V l m is an operator that creates or destroys transverse states. However, it 
can be shown that the { D m , L m , V l m } is a complete set of operators for the Fock 
space, and hence the two expressions for E in Eqs. (9.3.20) and (9.3.23) must 
be the same. 

To apply this argument to string field theory, we notice that we can split the 
field |0) into two pieces 

|0) = |cI>) t + |<1>}l, (9.3.25) 

where we have split the field into transverse (T) and longitudinal and ghost 
(L) sectors. By the gauge invariance 8 10) = Q\A), we can (up to states that 
vanish on shell) choose a gauge that removes the longitudinal and ghost states 
I*)l. 

Thus, the only part that is left after gauge fixing is 

j DXi Dc°{<$> t \c°(Lo - 1)tI<I>)t (9.3.26) 

up to terms that vanish on shell. The integration over the ghost c° is trivial, so 
we are left with the usual light cone action in Eq. (9.2.12). (In the proof, we 



9.4 Interacting BRST String Field Theory 289 


had to throw away pieces that vanished on shell. This means that off shell the 
BRST and light cone theories are actually different. However, this makes no 
difference to our discussion because we only need the equivalence on shell. 
Thus, when we calculate the expectation value between sets of BRST invariant 
asymptotic states at infinity, the actions for the BRST and light cone theory 
are different off shell, but they produce the same on shell matrix elements.) 

Last, we wish to show that we can choose a covariant gauge so that the gauge 
invariant BRST action [Eq. (9.3.13)] becomes the gauge-fixed BRST action 
[Eq. (9.3.12)]. Let us choose the Siegel gauge [13]: 

fc 0 |<t>}=0. (9.3.27) 

This eliminates about half the states within the field in Eq. (9.3.16). 

However, whenever we fix any local gauge invariance, we must also add the 
Faddeev-Popov determinant factor. The Faddeev-Popov ghost determinant 
factor can be written as 


(A\boQ\A) (9.3.28) 

which arises from the gauge invariance <$|4>) = Q|A). Since |<f>) has ghost 
number — ^ and Q has ghost number 1, this means that | A) has ghost number 
—| and (A| has ghost number §. 

Notice, however, that this ghost action, in turn, has its own gauge invariance 

«|A) = QIAO (9.3.29) 

which means that we must add yet another ghost term to the action, with ghost 
number equal to — In fact, every gauge fixing, in turn, yields yet another 
ghost action with a gauge invariance. This is the “ghosts-for-ghosts” effect, 
which introduces an infinite number of fields with differing values of the ghost 
number [19]. 

The net effect of all of this is simple: we can introduce a single field |4>), 
which has arbitrary ghost number, such that the gauge fixed action is 

<<b|(Lo + Lq c — 1)|<P) (9.3.30) 

which is just the gauge fixed action found earlier from Gupta-Bleuler 
quantization in Eq. (9.3.12). 


9.4 Interacting BRST String Field Theory 

We have seen the remarkable economy of string field theory at the free level. 
The entire theory of free open bosonic strings can be encapsulated into one 
simple action (<I>|(}|0). However, the situation with interactions is consid¬ 
erably more complicated. As we mentioned, the first quantized string theory 
summed over the set of all conformally inequivalent topologies. This conve¬ 
niently concealed many difficult questions concerning how to place coordinates 



290 9. String Field Theory 


on Riemann surfaces. The principle problem is that, until recently, mathemati¬ 
cians have been unable to successfully triangulate moduli space for genus g 
Riemann surfaces, even after a century of experience with these surfaces. Re¬ 
markably, string field theory gives an exact triangulation of moduli space, thus 
solving a long-standing mathematical problem. 

Let us begin our discussion by first requiring that open string field theory 
be a gauge theory that satisfies the axioms of gauge theory. Specifically, we 
need to postulate the existence of a derivative Q and a product operation *. 
We postulate the following five axioms due to Witten [2]: 

(1) The existence of nilpotent derivative Q such that Q 2 — 0. 

(2) The associativity of the * product: 


[A * £] * C = A * [£ * C], 

(9.4.1) 

(3) The Leibnitz rule: 


Q[A *B} = QA*B + (—1) ,a| A * QB. 

(9.4.2) 

(4) The product rule: 


J A*B = (-l) l ' 4||S| J B* A. 

(9.4.3) 

(5) The integration rule: 


f QA = 0, 

(9.4.4) 


where (—1 ) |i41 is — 1 if A is Grassman odd and + 1 if A is Grassmann even. 
We postulate that the field A has the following transformation rule: 

SA — QA + A*A — A*A. (9.4.5) 

Then we can construct a curvature form given by 

F = QA + A* A (9.4.6) 

such that 

SF = F*A — A*F. (9.4.7) 

It is easy therefore to show that the following is a total derivative: 

/ F * F = / Q[A*QA + \A*A*A]. (9.4.8) 

Therefore, the Chem-Simons form is gauge invariant [2]: 

L = A*QA + \A*A*A. (9.4.9) 

The Chem—Simons form is preferable to the usual F 2 form found in ordinary 
gauge theory, because Q already has two derivatives contained within it. 



9.4 Interacting BRST String Field Theory 


291 



This formalism works for any gauge theory, not just strings. Our task is to 
find a multiplication operation that satisfies the postulates of the * product. 
Then, gauge invariance is automatic, without any more work. 

We notice, first of all, that the * operation is symmetric in all three strings. 
There is only one unique configuration that is symmetrical in all three fields, 
and that is given in Fig. 9.4, where the midpoint of the strings has been singled 
out. 

The multiplication operation 

|X 3 ) = |X 1 >*|X 2 > (9.4.10) 

simply means that we have exchanged the Fock spaces of strings 1 and 2 for 
string 3, such that the points along 1 and 2 have been identified with points 
along string 3. In analogy with Eq. (9.2.19) in the light cone theory, we will 
define the triple product (without ghosts) as a delta function 

cj> *<&*<& = J DX\ DX 2 DXi <&(Xi)<D(X 2 )<I>(X3) 

3 

x FI n S i X rA°r) - X r _ liM (7T - -(9.4.11) 

r=l 0<cr T <7i/2 

Let us now write the ghost number for all the operators in the theory. The 
c ghost has ghost number 1, the b ghost has ghost number —1, so that Q has 
ghost number 1. This, in turn, fixes the ghost number of the A field to be — 
since the action contains a term (A | Q | A), which must have total ghost number 
0 . 

The ghost number of the gauge parameter A and the * operation can be fixed 
by observing the variation of the A field in Eq. (9.4.5). In order for the left-hand 
side (with ghost number — to equal the ghost number of the right-hand side, 
the ghost number of A must be — \ and the ghost number of the * operation 
must be +|. 

Similarly, we can fix the ghost number of the / operation by demanding 
that the action have total ghost number zero. Putting everything together, we 



292 9. String Field Theory 


have the following set of ghost numbers [2]: 


j : - (9-4.12) 

A: -§. 

To enforce these ghost numbers poses no problem. On the A field, this means 
taking all possible sums of monomials constructed out of the c and b oscillators 
acting on the vacuum and then projecting out the — | ghost number part. 

More difficult is the * operation. In addition to the Dirac delta functional for 
the in Eq. (9.4.11), we must also include the ghost part as well. The major 
complication (which becomes more severe as we progress to the superstring) 
is that there must be ghost insertion operators placed at the midpoint of the 
vertex function. This is because there is an anomaly in the ghost current. 

There are two ways to correct for this, depending on whether we bosonize 
the ghost fields or not. If we bosonize the ghosts, we must note that the energy- 
momentum tensor for the bosonized field has a screening charge. To see this, 
let us bosonize the b, c ghost system with a weight 0 field a, with 

c =: b = :e~ a (9.4.13) 

Let us calculate the energy-momentum tensor for this field, repeating our 
earlier discussion in Eqs. (3.1.29)—(3.1.33). We have 

T(z) = -\bada +kd 2 a (9.4.14) 

and we take its operator expansion product to calculate its central charge 

T(w)T{z)-2(\) 2 {d w ad z o) + k 2 [blad 2 z a)^ ^ -I-. (9.4.15) 

So, the central charge is 

c= 1 + 12 k 1 . (9.4.16) 

The central charge for the b, c system is —26, so we have k 2 = — |. This 
gives us an imaginary value of k , which we can rectify by reversing the overall 
sign of the kinetic energy term. Thus, the final energy momentum tensor is 

T(z ) = +^9cr da + \d 2 a. (9.4.17) 

We have a linear term in the energy-momentum tensor for the ghost field. 
This, in turn, means that the action itself, in terms of the bosonized field, must 
contain a linear term as well. Because we will be concerned with world sheets 
that have curvature singularities in them at the points where strings break, we 
will need the fully generally covariant generalization of this formalism. The 
full energy-momentum tensor of the bosonized ghost system, with general 
covariance put back in, is 

Tab = d a cr d b a + \g ab {d 2 o) - j(d a d b - g ab d 2 )a. 



(9.4.18) 




9.5 Four-Point Amplitude 293 


Our next task is to calculate the action that yields this energy-momentum 
tensor T ab , via the definition 


SS = J j d 2 i; 8g ab T ab . (9.4.19) 

The final action is not too difficult to find. We recall from general relativity 
how to take the variation of the curvature tensor 

8 Jd 2 lgR = -J d 2 l 8g ab [{d a d„ - g ab d 2 )o ] + • • •. (9.4.20) 

Therefore, our final result for the action is given by [20]: 


■ h / d2f g “ a °° bb ° + 1 [\ / ^ gR+ 1 


dl kcr 


(9.4.21) 


where we have explicitly added the contribution from the boundary surface 3 
and where k is the extrinsic geodesic curvature of the boundary and dl is the 
line element along this boundary. 

If a is a constant, then this reduces to the Euler number of the surface 


X 


^[U d! ^ R+ I/ ikg , 


(9.4.22) 


The point of going through this exercise is that the curvature R is zero for 
most of the surface. However, for the breaking point of three strings, R has a 
delta function singularity, and the Euler number is equal to — Thus, for a 
self-consistent ghost system, we must insert an extra factor [2]: 


^ 0,2 


(9.4.23) 


into the vertex function precisely at the midpoint. Then, the rest of the ghost part 
of the vertex function is a Dirac delta functional, just as for the string variable, 
representing the continuity across the vertex. The modification of Eq. (9.4.11) 
therefore involves inserting Eq. (9.4.23) at the midpoint and multiplying by 
the Dirac delta functional for the a field (in analogy with the three strings). 
A careful analysis of the resulting vertex function shows that it satisfies the 
correct properties of multiplication and that we can successfully reproduce the 
Veneziano model [21-14]. 


9.5 Four-Point Amplitude 


We now begin a discussion of constructing the four-point Veneziano amplitude 
in Witten’s string field theory. The four-point scattering amplitude will have the 
geometry as shown in Fig. 9.5. We wish to show that the scattering amplitude 
given by the field theory 


A 4 (s, t ) = 



-r(Lo-l) 


IV 534 ) + (s ** t) 


(9.5.1) 



294 9. String Field Theory 




gives us back the familiar Veneziano formula. (Note the insertion of the factor 
bo in the propagator, which arises when we choose the & 0 <I> = 0 gauge.) 

In order to perform this calculation, we need several ingredients, including: 

(1) the conformal map taking us from the upper half-plane to the world sheet 
of the string scattering 

(2) the Jacobian of the transformation from r to the string world sheet to x of 
the Koba-Nielsen variables; and 

(3) the ghost contribution. 

First, let us calculate the conformal map, which will take us from the upper 
half-plane to the configuration shown in Fig. 9.5. In contrast to the light cone 
gauge, we need a conformal map that has a Riemann cut, as in the case of the 
three-vertex function. Using the Schwarz-Christoffel transformation, we find 
that the following map has the desired properties [21]: 

dw N yjz 2 + y 2 Vz 2 + <$ 2 
dz 2 (z 2 - a 2 )(z 2 - fi 2 ) ’ 


(9.5.2) 



9.5 Four-Point Amplitude 295 


where the Riemann cut goes from i\y\ to -i\y\ and from ±/|S| to ±oo, as 
shown in the figure. 

Now, let us place boundary conditions on the map so that all external strings 
have equal string lengths: 

(1) In order that the strip width at A equal that at B, we demand 


ap — y8. 


(2) The strip width at A must be n. This gives 

dev 1 

dz z — a 

which gives us the normalization of N : 


N — 2a 


P 2 — a 2 

yja 2 + y 2 *Ja 2 + S 2 


(9.5.3) 

(9.5.4) 


(9.5.5) 


(3) The segment FE has half the length of the strip at A. This condition is a 
bit more difficult, because it requires us to actually perform the integration 
from (0, iy ) to (iy, iS ). 

The integral in question requires the theory of first and third elliptic in¬ 
tegrals. For example, we have the standard definition of a first elliptic 
integral 


f 


dt 


J(\-t 2 )(\-k 2 t 2 ) Jo y/l-k 2 sin 2 e 


f 


dd 


ru i 

= / du — u\ = sn~ x {y, k) = F(0, k), 
Jo 


(9.5.6) 


where y = sirup and 0 = am u x . 

The first complete elliptic integral is given by the definite integral 


K(k) 


r */2 

Jo 


de 


k 2 - y - 


k' 2 = 1 -k 2 . (9.5.7) 


Vl - k 2 sin 2 9 ’ " 82 

The integral of Eq. (9.5.2) for boundary values in (3) is given by 
7i N f8 2 -y 2 \ f K(k) 

2 = -y(^r )l 

x _1_1_1 

_ 1 — rj\sn 2 (k, u) 1 — r)\sn 2 (k, u) J ’ 


where 




y 2 p 2 + 5 2 


2 


8 2 P 2 + y 2 ' 

Performing the integral, the condition reduces to 
\ = A 0 (&i, k) - A 0 (0 2 , k). 


(9.5.8) 


(9.5.9) 


(9.5.10) 



296 9. String Field Theory 


where 


sin 2 9\ 


P 2 + y 2 ' 


sin 2 9 2 — 


a 


a 2 + y 


2 ’ 


and where A 0 is Heuman’s lambda function given by 


(9.5.11) 


A 0 (£, Jfc) = ~[EQc)F{fi, k') + K{k)E{fi, k') - K(k)F(/3, k% (9.5.12) 

7T 

(4) The segment DE has length r. Once again, we must perform the integral 
of Eq. (9.5.2), which yields 


r/2 - K(k f )[Z{6 2 , Jfc') - Z(O u k')\ (9.5.13) 

and the Jacobi zeta function is given by 

Z(j8, *) = £08, k) - (E/K)F(fi, k). (9.5.14) 


We would also like the explicit relationship between the parameters in 
the map and x , the Koba-Nielsen variable that will appear in the Veneziano 
formula. We find 


x — 



(9.5.15) 


Also, we want the Jacobian that takes us from dr, defined on the string 
world sheet describing four-string scattering, and the parameters of the upper 
half-plane. An explicit calculation gives us 


2n da 

K(Y 2 ) y/l + a 2 y 2 yja 2 + y 


(9.5.16) 


Last, we must add the contribution of the ghosts. Let er l *+ represent the 
bosonized ghost contribution coming from the b ghosts. We find that the ghost 
part of the amplitude Aq is given by 


-f 


dz dz 


2ni dw 


exp \-J2 (<p(j)4>(k)) + {4>(j)4>+(z )) • (9.5.17) 

_ j<k j 

It is easy to evaluate the contraction over these ghost fields. They are given by 


{<t>(j)<l>(k)) = (In \zj -z k \+ In I Zj - z k [), 
(<Kj)<t>+iz)) = -{[Hzj - z) + In (z,j - z)]. 


Thus, the ghost contribution is given by 


A a = (a 2 



dz dz 1 

2 ni dw ( a 2 — z 2 )(or 2 — z 2 ) 


[ —y/a 1 + y 2 y/\+ a 2 y 2 ( 1 - a 4 )a~ 3 K(y 2 ). 
J 2n 


(9.5.19) 



9.6 Superstring Field Theory 297 


Now, let us put all factors together. Notice that the K{y 2 ) factor coming 
from the ghost cancels precisely the same factor coming from the Jacobian. In 
fact, once the contribution from the string itself is included, we have the final 
form of the s -channel graph [21]: 

As = -2 j da exp g P, • P t (X(j)X(k))j 

= ~lf dxx lp ' P2 (l -x) 2PlPi . (9.5.20) 

4 J 1/2 

When we add the t -channel contribution, the line integral goes from 0 to 1, and 
we retrieve the four-string Veneziano amplitude [Eq. (1.2.8)], as promised. 


9.6 Superstring Field Theory 

Buoyed by the relatively easy successes of the bosonic open string field theory, 
we now wish to generalize our discussion to the case of NS-R superstring field 
theory. Unfortunately, the ease with which the bosonic open string field theory 
was constructed rapidly disappears when we begin to generalize our discussion 
to the case of superstrings. Let us review the NS-R superstring theory and 
isolate where the problem lies. 

We saw earlier that the NS-R theory has, in addition to the usual fermionic 
b , c ghost system, a bosonic ghost system /J, y in Eq. (1.3.19). The complica¬ 
tion arises when we analyze the zero modes of these ghost operators. For the 
sake of convenience, let us combine both sets of ghosts into one set b and c, 
such that ghost Lagrangian becomes [26]: 

5 = ^- y d 2 z b 3c + c.c., (9.6.1) 

where the b field has conformal weight X and the c field has conformal weight 
1 — X. 

Then, the energy-momentum tensor is 

T(z) = -Xb 3c + (1 - X)3b c, (9.6.2) 

where we have the normal mode decomposition 

b (z)= J2 z 

n€.S —A.+Z 

c(z) = ^2 Z 

where 5 = 0 for NS and 5 | for R boundary conditions. 


-n-Xfo 
u n ? 

-n — 1+A. 


C n 9 


(9.6.3) 



298 9. String Field Theory 


For the b , c system, we have X = 2, and for the /}, y system, we have X = \. 
The ghost number current can be written as 


j(z) = -be = Yh z " 1 j" 

n 

jn = ^ fcbfc, 

it 


(9.6.4) 


where e = +1 for Fermi statistics and e = — 1 for Bose statistics. 
Alternatively, we can form the ghost superfields 

B(z) = P(z) + 9b(z), 

C(z) = c(z) + 0y(z), 
and the ghost action can be written as 

S = ^ jd 2 zddd0 B DC + c.c. 

and the ghost energy momentum is 

T(z) = -CdB + \DC DB-\SCB 
which can be decomposed as 

T F {z) = -cdp-\dcP + \yb, 

T B (z ) = c db + 2 deb - {y dp - |3 y p. 

The BRST current can be written as 


(9.6.5) 


(9.6.6) 

(9.6.7) 

(9.6.8) 


^brst = DC(C DB — \DC B). (9.6.9) 

The problem arises when we try to define the ground state of the theory. For 
example, we have the freedom of defining several possible ground states: 


where 


b„| q) — 0, n > eq — A., 
Cnl?>=0, n > -eq -f A, 


(9.6.10) 


joM = q\q), 

Lo\q) = \tq{Q + q)\q). 


(9.6.11) 


where Q — e{\ — 2X) and A. = ^(1 — eQ). For the b , c system, we have 
e = 1, A. = 2, Q = —3, and the central term c = —26. For the /?, y system, 
we have 6 = —1, X = |, Q = 2, and c — 11. This is indeed puzzling 
because, in contrast to the usual Fock space associated with the bosonic string 
oscillators a n , we apparently have the disaster of having an infinite set of 
different vacuums, each labeled by q ! 

For the bosonic string with its Fermi ghosts b, c, the situation is actually 
simple to analyze. By multiplying with various monomials in the oscillators, 
it is possible to show that the various | q) vacuums are actually redundant. We 



9.6 Superstring Field Theory 299 


recall for the bosonic ghost system that we had two zero mode oscillators Co 
and bo, which meant that the vacuum state was degenerate: 


£ol+) — 0, bo \—) — 0, 
1+) = cq\—). 


(9.6.12) 


We notice that there are two vacuums |±) that are related to each other by the 
multiplication of a monomial. 

Now, we can compare the old vacuums with the new ones. We find 


\q = D = \-h \q =2) = |+). (9.6.13) 

We can calculate the scalar product between these vacuums by noticing that 
we can place jo between the scalar product of any two states. The values of 
the eigenvalues of jo change on (q | states. This is because we have the strange 
identity 

j l = j—m - QS m , o. (9.6.14) 

It is now easy to show that the only surviving nonzero scalar product is 

(-<7 ~ Q\q) = 1- (9.6.15) 

This means 

(2|1) = 1. (9.6.16) 

This also means that the other vacuums \q) are actually not independent vac¬ 
uums at all. For example, the vacuums |0) can be written as the product of a 
monomial and the “true” vacuum 


|0)=fe-i|l), |0) — b—\bo |2). (9.6.17) 

Similarly, other “vacuums” can be written in terms of the vacuums |±): 

|4) = c_ic_ 2 |2> = coC—\C— 2 11 > > 

|3) = c. 1 |2)=c 0 c_ 1 |l), (9.6.18) 

|-1)=ZM&- 2 |1>= Vm^-2|2). 

Thus, we can always take the “true vacuum” to be |—}, although this choice is 
not unique. 

For the Bose sector, generated by the /?, y ghosts, the situation is much 
more difficult. We find, in fact, that the | q) are actually linearly independent 
and that no combination of monomials constructed from the oscillators can 
convert one vacuum into another. This means that the y ghost sector of the 
NS-R superstring has an infinite number of vacuums ! 

However, the “vacuums” that come closest to the usual definition of vacuums 
can be defined in the NS sector as \q = — 1) and in the R sector as either 
|q = -") or |q = —This is because, for the NS sector, we have (for 



300 9. String Field Theory 


half-integer m, n ): 

An|-1)=0, m>\, 

NS: J (9.6.19) 

Yn\ ~ 1) = 0, n > 

Because these states act on the |—1) vacuum, we will call this the 1 picture.” 

For the R sector, we have (for integral m, n): 

Pm\-\)=b, rn > 0 , 

R: (9.6.20) 

Yn\ ~ \) = 0, n > 1, 

and 

'&.!-§> = 0 , 

R: 

y*l -1> = o, 

Their scalar product are as follows: 

<-f|-i) = l, <-l| - 1> = 1. (9.6.22) 

However, we also note that we could equally have taken the “zero picture” for 
the NS states based on the vacuum |q =0). For the zero picture, we have 

A«|0), n > -f, 

NS: { (9.6.23) 

y„|0>, n>\. 

The existence of an infinite number of linearly independent vacuums for 
the Bose /?, y ghost sector at first seems like a disaster. However, there is 
a simple resolution for all of this. It is possible to show, on-shell that all 
these vacuums are actually equivalent. We will call these different sectors of 
the theory different pictures , and we can show that, on-shell, all pictures are 
equivalent. 


9.7 Picture Changing 

The simplest way of showing this is to bosonize the /?, y system. At first, this 
may seem a bit strange, since /3,y are already bosons, but we can write them 
as the product of two fermions and then bosonize these fermions. This can be 
implemented by the following definitions [25]: 

£ = y = e+ri , (9.7.1) 

or 


m > 1 , 

(9.6.21) 

n > 1. 


rj = dy e ^, 


9$ = d/ie+. 


(9.7.2) 



9.7 Picture Changing 301 

where we have written the bosons as the product of two fermions. We can then, 
in turn, bosonize once more 

^ — e x , rj = e~ x . (9.7.3) 

At first, this strange bosonization may seem formal and a bit useless. 
But, there are several advantages to this bosonization. First, we can use this 
bosonization to interpolate between the NS and R sectors. Second, we can 
use this to create a fermion vertex function. Third, it will enable us to write a 
“picture changing” operator, which can be used to show that the S matrix is 
independent of the picture we use [26]. 

Notice that, in this formalism, 


j(z) = € d<p(z) 

(9.7.4) 

with the operator product expansion 


j( t)£?0(«>) ^ ^ £<70(u>) 

z — w 

(9.7.5) 

and 



T(z)e q<t,{w) ~ [\cq{q + Q)(z - w)~ 2 + (z - w)- l d w ]e*« w \ (9.7.6) 
This means that e q<t> has conformal weight given by 

\eq(q + Q). (9.7.7) 

Because the states | q) can be defined via their Lo and j eigenvalues and 
because multiplication by e q * can change these eigenvalues, we conclude that 
multiplication by e q<t> can actually change the eigenvalue q : 

e qm \0) = | q). (9.7.8) 

This is a rather remarkable formula. We saw in previous chapters that the NS 
and R sectors were based on distinct Fock spaces. Now, we see that multi¬ 
plication by e q(t> can actually change NS vacuums into R vacuums and vice 
versa. 

Now, we would like to rigorize some of these comments concerning picture 
changing. This analysis will be greatly facilitated by the introduction of sev¬ 
eral important operators. First, there is the “picture changing operator,” which 
allows us to go from one picture to another: 

x(z) = (e,$(z)} 

- d z X„ +c3 z |- \b d.tje 2 * - \d z (br ) e 2q ’). (9.7.9) 

which raises the total ghost number by one and has zero conformal weight 
(and hence may be multiplied with any vertex function without changing its 
conformal weight). 

Notice that X is automatically BRST invariant because it is written explicitly 
as a BRST commutator. (However, we should be careful to note that X is not, 
therefore, BRST trivial. Notice that £ is not part of the usual Fock space of 



302 9. String Field Theory 


operators that we have defined, since Eq. (9.7.1) is defined only with 9£. Thus, 
although X is written as a BRST commutator, it is not BRST trivial because 
£ is part of a “big algebra” and not part of the “small algebra,” which consists 
of the usual Fock space of operators.) 

This operator can also be written as the BRST commutator of a step function 

m- 


X(z) = {Q,9[f}(z)]}. (9.7.10) 

Once again, we see that X is BRST invariant because it is a BRST commu¬ 
tator, but it is not BRST trivial because we have introduced a new operator, 
the step function 0 , which is not part of the usual small algebra of ordinary 
operators. 

There is also the “inverse picture changing operator” 

Y(z) = 4c e- 74 (9.7.11) 


which performs the opposite function of X(z), that is, it reduces the ghost 
number by one and also has zero conformal weight. The Y inverse picture 
changing operator can also be written as a Dirac delta function 

Y(z) = -c(z)S'[y(z)]. (9.7.12) 

Formally, X and Y are inverses because 

lim X(z)T(u>) ~ 1. (9.7.13) 

z-*w 

Extreme care, however, must be exercised whenever using products of these 
operators, since they are potentially ill defined, especially when taken at the 
same point. For example, products of these operators are actually infinite when 
taken at the same point 


lim X(z)X(u>) ~ oo. (9.7.14) 

z->w 


Products of these picture changing operators are actually only well defined 
on-shell on the physical Fock space, that is, on the cohomology class of Q. 
Products of these operators at the same point still diverge, but the divergence is 
outside the cohomology class of Q. Thus, if we restrict all of our manipulations 
within the cohomology class of Q (the physical subspace), then these operators 
are well defined. 

To see this, we shall introduce the operators 


:X 0 
: Y 0 

The key identity we want is 


/ 

t 


dz X(z) 
2ni z 
dz Y(z) 
2 7i i z 


:X 0 


T 0 ~ 


(9.7.15) 


l + [<2,4 


(9.7.16) 



9.8 Superstring Action 303 


where e is some operator. Because we restrict all of our comments to the 
cohomology class of Q, we can drop the second term (which may be infinite), 
meaning that we have an infinite number of picture changing operators and their 
inverses. On-shell, however, all these pictures are equivalent. To go between 
any of the various pictures (on-shell), one simply multiplies by : X 0 or 

. r 0 . . 

For completeness, we list some more identities involving the ghost operators 

viz) = zdY&[y{z)], 

e -* = 8[y{ z )], (9.7.17) 

e* = i[j8(z)j, 

which can be proven by showing that the left and right side of the equations 
have the same operator product expansions. 


9.8 Superstring Action 


Now, let us apply this technology to the superstring action. Let us first work 
in the “—1 picture,” where the A field has ghost number — ~ as usual, and the 
field has ghost number zero. 

The only complication we have in counting ghost numbers is the vertex 
function. If we bosonize all the ghosts, then there are three ghost contributions, 
given by <r, (p , and x • We can proceed as before, calculating their contributions 
to the energy-momentum tensor and then extrapolating back to obtain the final 
action. Repeating the same steps as for the bosonic ghost, we find 


5 


T J d2 $ 8 ab (3 aX 3 bX - 3 a<t> 3 b <t>) 

+ ^[\f d 2 £ R i* + 2 <t>) + f dlk iX + 2 <P) 


(9.8.1) 


From this, knowing that the curvature tensor R is zero except at the midpoint 
of three strings, we can read off the ghost contribution at the midpoint 

e i(p+ix/2 , (9.8.2) 


Including the ghost contribution from <7, this gives us a total ghost number 
of j for a new vertex *. This ghost number, however, causes some technical 
problems with regard to the counting. We can, for example, postulate the 
following variation of fields: 

SA = QA + X(A *A — A *A)-F v I / *x — 

V ' (9.8.3) 

<Svp — Q X — * A — A * *F). 

Notice that, in order to get the ghost numbers correct, we had to insert a new 
operator X at the midpoint. This operator has ghost number +1 and conformal 



304 9. String Field Theory 


weight 0. Fortunately, such an operator does exist, and it is the picture changing 
operator we met before. 

Similarly, there is now a problem with getting the ghost number of the free 
fermions to come out correctly. Naively, the net ghost number of f ^ * Qty 
equals +1, so we have to insert yet another operator with ghost number —1 at 
the midpoint. The most likely choice is the inverse picture changing operator 
F_i. The ghost numbers are 


y: l 

fi: 

A: - 

S': 0, 


* 


1 


2’ 



(9.8.4) 


With this choice of ghost numbers, the action in the —1 picture becomes 
[27]: 


5_i 



A * QA + j J 



* A * A) 
A * 'I' * 


(9.8.5) 


It is easy to check that the action is invariant under the postulated 
transformations. 

At first, we seem to have a self-consistent field theory with a new supergauge 
invariance. However, this is not true. There is a fundamental flaw in the above 
action, the Wendt anomaly [28]. 

The problem arises because the picture changing operator X, defined at the 
same point, diverges 


lim X(z)X(w) - oo. (9.8.6) 

Z-*U) 


This divergence does not appear in the action, or in the gauge variation of 
the action, but it appears everywhere else, that is, in the amplitudes, the closure 
of the algebra, and the variation of the equations of motion [28]. For example, 
when calculating the bosonic four-string interaction, we are led to construct 
propagators such as 


X 



X 


(9.8.7) 


which are inserted between two vertex functions. By commuting the X past 
the propagator, we find that we can cancel the 1/(L 0 - 1) term, in which case 
the two picture changing operators appear on top of each other. Similarly, by 
taking the multiple gauge variation of the fields, we find that the X’s appear 
on top of each other. To remedy the situation, we will go to the “0 picture” 





9.8 Superstring Action 305 


[29, 30], in which case the only changes in the ghost number are 

gh(A) +i, gh(A) -> -i. (9.8.8) 

All other ghost numbers remain the same. In the 0 picture, the new variations 
are 

SA = QA + A * A — A * A + * X ~ X * 

Sty = Qx + A*x—+ — A**!*. ^ ^ ^ 


Notice that if we take multiple variations of the fields or if we vary the 
equations of motion we never have the problem with X 2 . The new action in 
the 0 picture is [29, 30]: 



y_ 2 A *QA+^j Y- 2 A *a*a 

J y^w * QV + J y~\A * ^ * 'i'. 


(9.8.10) 


The important point is that the collision of two Z’s does not occur in the 0 
picture. 

To see this, let us make a gauge variation of the equations of motion to see 
if any divergent operators occur. The equations of motion are 


QA + A * A + X(V * vp) = 0, 
Q'ty + A*q> + v I / *A = 0. 


(9.8.11) 


A gauge variation of these equations yields 

S[QA + A * A + X(V * 'I')] = [QA + A * A + X(V * ^), A] 

+ X\Q'V + A * V + ^ * A, xl, 

L J (9.8.12) 

8(Q* + A*'I' + '1'*A) = [g'l' + A* V I' + 'I'*A,A] 

+ [QA + A * A + X(V * q»), x]. 


In fact, the only potentially singular operator identity we have when we iterate 
the gauge transformation is 

y_ 2 X = y_i. (9.8.13) 


To complete the 0 picture, we need to construct an explicit operator expres¬ 
sion for y_ 2 , which satisfies the above equation. There are actually several 
possibilities [29, 30]. The simplest uses a trick introduced in Ref. 30, that is, 
we introduce operators defined in the lower half-complex plane: 

y_ 2 = Y(i)Y(-i), (9.8.14) 

where i represents the intersection midpoint of the string, and —i symbolically 
represents the point with opposite chirality (i.e., it exists in the lower half¬ 
plane). Because the point — i never meets the point i (because they are on 
opposite sides of the real axis), there is never any problem with divergent 
operator expansions. 



306 9. String Field Theory 


However, this is not the only possibility. We can, for example, write all 
possible operators that are BRST invariant (but not BRST trivial), have the 
correct ghost number, and obey the previous expression. There are 15 possible 
operators that have the correct ghost and picture constraints. When we de¬ 
mand BRST invariance, this list reduces to five possible operators (and linear 
combinations of them), which then satisfy all possible constraints. 

At this point, it appears that we have too many possible candidates for 
the operator F_ 2 which have all the desired characteristics. Fortunately, one 
can show that all six possibilities of Y^l (and their combinations) are BRST 
equivalent to the original F_ 2 , that is, 

Y ( i\ = Y- 2 + {Q,y (i) ] (9.8.15) 

for some operator v (,) . 

In summary, we have now constructed a large class of open superstring 
field theories, each represented by a particular choice of Y^l, but all of them 
on-shell equivalent. Thus, the on-shell theory, which is the only physically 
relevant theory, is unique. 


9.9 Summary 

The first quantized approach suffers from many problems, including the 
following: 

(1) Millions of possible vacuums are known, but the first quantized theory 
cannot choose which, if any, is the correct vacuum. 

(2) The first quantized theory cannot break supersymmetry, spontaneously 
compactify from 10 to 4 dimensions, or give us a completely successful 
low-energy phenomenology. 

(3) The first quantized approach may not be Borel summable. 

(4) The first quantized approach is not manifestly unitary. 

(5) Last, the first quantized path integral is not well defined, because the sum 
over conformally inequivalent surfaces is a classical, unsolved problem in 
Riemann surface theory. 

Within the last few years, three explicit triangulations of moduli space have 
been discovered, two of them arising from string field theory: 

(1) light cone coordinates, found in the light cone string field theory; 

(2) Harer coordinates, used in Witten’s string field theory; and 

(3) Penner coordinates, which cannot, as yet, be written as a string field theory. 

String field theory is a comprehensive method of formulating string theory 
nonperturbatively (although it still cannot compute the correct nonperturbative 
vacuum). String field theory is based on defining a multilocal field functional 



9.9 Summary 307 


simultaneously defined at all points along the string: 

<D(X) = lim <t> [X„(or,) • • • X^a N )] . (9.9.1) 

N-^oo L J 

The simplest string field theory is the light cone theory, which begins by 
solving the Virasoro constraints 

X' 2 

Pl + —£ = 0, = 0. (9.9.2) 

7C l 

In the light cone gauge, P~ becomes the new Hamiltonian 

+ (9.9.3) 

so that the free action reads 

I DX t dp + 0>; + (X,) (i 4 > p+ (X,). (9.9.4) 

To compute interactions, we first use the conformal map between the upper 
half z plane and the string surface 


N 

p(z) = ln(z-z,)- (9-9.5) 

1 = 1 

Given all possible maps that one can define in this fashion, we find that 
the structure of light cone string field theory can be symbolically written as 
follows, for open (closed) string fields represented by O('I'): 

Lopen = d > 3 + <D 4 + <J> 2 ^ + tf 3 + 

_ _ 3 (9.9.6) 

closed — ^ • 

By contrast, the covariant BRST open string field theory is based on the 
simple observation that Q is nilpotent. Thus, the action 

S = j DX Db Dc Db Dc (9.9.7) 

where <t> has the fixed ghost number — 5 , is invariant under 

5<D = QA. (9.9.8) 

The equation of motion Q<t> = 0 yields the correct Fock space for the string. 

If we take the gauge 

*ol<*>)=0, (9.9.9) 

then the Faddeev-Popov ghosts have ghosts. In fact, we have an infinite se¬ 
quence of “ghosts-for-ghosts” whose quantization can best be described by 
the Batalin-Vilkovisky (BV) quantization method. The net effect of all these 



308 9. String Field Theory 


Faddeev-Popov ghosts is simple: we simply relax the ghost number on <I> to 
include all possible ghost numbers, with the action 


L = {<t>(X, b, c) |(I* + Ll h - l)|<t>(X, b, c)}. 


(9.9.10) 


The fully interacting open string field theory can be written in exact parallel 
with the postulates of ordinary gauge theory : 

(1) the existence of the nilpotent derivative Q such that Q 2 — 0; 

( 2 ) the associativity of the * product 


[A * B] * C = A * [B * C]; 


(3) the Leibnitz rule: 


Q[A * B] = QA * B + (-1 r'A * QB\ 


(4) the product rule: 


( 5 ) the integration rule: 


A*B = (- If 


B * A; 


i QA = O' 


(9.9.11) 


(9.9.12) 


(9.9.13) 


(9.9.14) 


where (—l ) 1 ' 41 is —1 if A is Grassman odd and +1 if it is Grassmann even. 

We postulate that the field A has the following transformation rule 

SA = QA + A* A - A* A. (9.9.15) 

So, the Chem-Simons form is gauge invariant 

L = A*QA + lA*A*A. (9.9.16) 

The * multiplication rule is carried out via the Dirac delta function. The bosonic 
vertex (without ghosts) is given by 

j DX x DX 2 DX 3 <b(X0 * <h(X 2 ) * <t>(X 3 ) 

= J DX\ DX 2 DX 3 <b(X 1 )<I>(X2)‘t>(X3) 

3 

x f[ n 4 X 'W*( flr r) - Xr-\A n - ffr-l)]. (9-9.17) 

r —1 0<CT r <jr/2 

The ghost part of the vertex function is a bit more tricky. The b , c system 
can be bosonized in terms of a scalar field a, which has the action 

s = h /^ gabda<Tdba + ^[U d ^ gR+ 1 dlko ' (9 ' 9 ' 18) 



9.9 Summary 309 


Notice that if the world sheet of the interacting string is flat, there are 
delta function curvature singularities at isolated points (the midpoints). This 
contributes a factor 


Jio/2 


(9.9.19) 


at the midpoint of three strings. Thus, the * multiplication rule must have a 
ghost insertion of \ ghost number at the midpoint. This, then, gives us the 
ghost numbers for all fields and operators 


c 

b 

Q 

A 


1, 

- 1 , 

1, 


*: 


/ : -*• 


A: 


(9.9.20) 


We can also show that the BRST theory reproduces the four-string scattering 
amplitude 


A 4 (s 


/»oo 

, 0 = / dr(Vi25 
Jo 


I b 0 e 


-r(Lo-l) 


I V 534 ) + (s t). 


(9.9.21) 


The conformal map that takes us from the upper half z plane to the w plane 
of the interacting strings is 


dw N yjz? + y 2 \/z 2 + S 2 
dz 2 (.z 2 — a 2 )(z 2 — P 2 ) ’ 


(9.9.22) 


Then, it is straightforward to calculate the Jacobian taking us from r defined 
on the world sheet to x, the Koba-Nielsen variable. We find 


dr — 


2n 


da 


K(Y 2 ) >/l + ol 2 y 2 yja 2 + y 
Then, the ghost contribution can be written as 


_ f dz dz 
J 2ni dw 


Ag = / — exp 

so that the final result is 


- {dUWk)) + X! { ( PU)4>+(z)) 

j<k j 


-J2 p j- Pk{X(j)X(k)) 


(9.9.23) 


(9.9.24) 


A s = —2 f da -—exp 
J a 3 

= — j f dx x 2PlPl (l — x) 2PlP \ 
4 J 1/2 


L j<k 


(9.9.25) 


which gives us the Veneziano model when we add the contribution from the 
other channel. 




310 9. String Field Theory 


The superstring field theory, however, is more involved. We start with the 
fi, y ghost action, which can be written as 

S = i jd 2 zbdc + C.C., (9.9.26) 

where the b field has conformal weight A and the c field has conformal weight 
1 — A. Then, the energy-momentum tensor is 


T(z) = —Ab 3c + (1 — A)3bc, 
where we have the normal mode decomposition 

b(z)= £ z~ n ~%, 

n€:S —A+Z 

c(z)= ^" _1+Xc «- 


(9.9.27) 


(9.9.28) 


For the b, c system, we have A = 2, and for the y system, we have A = |. 

The problem is that there are an infinite number of degenerate vacuums for 
the ft, y system 


b„| q) = 0, n > eq - A, 

c„|?)=0, n > —€q + A, 

where 


(9.9.29) 


joltf) =q\q), 

Lo\q) = \eq(Q + q)\q), 


(9.9.30) 


where Q = e(l — 2A) and A = ^(1 — eQ). For example, for the —1 picture, 
we have 


NS: 


AmI~1} = 0, m>\, 

y n | —1)=0, n > 


(9.9.31) 


Because these states act on the | — 1) vacuum, we will call this the —1 picture. 

We can bosonize these ghosts by a trick. First, we break up each boson into 
the product of two fermions, and we bosonize each fermion 


(6 = e-* Of, y = e+ri. 


(9.9.32) 


or 


n = dye~*, d$ = ape*. 


(9.9.33) 


This gives us the possibility of writing two new operators, the picture changing 
operator X and the inverse picture changing operator Y: 


X(z) = {G.*(z)} 

= + C - \b d z T) e 2 * - \d z (bT] e 2 *) 


(9.9.34) 



References 311 


and 


Y(z) = 4cd z ^e~ 2,l> . (9.9.35) 

In the original — I picture, there was the problem of X 2 = oc appearing in 
the amplitudes, as well as in the iteration of the gauge invariance. To solve this 
problem, we go to the 0 picture where the action is 

S.\f Y.^QA + \f Y^A*A 

+ ^J Y^*Q^ + j F_i A * 'I' * (9.9.36) 

This hinges upon developing a new operator, y_ 2 , with the property 

y_ 2 X = y_ 1 . (9.9.37) 

One candidate for this is 

y_ 2 = Y(i)Y(-i). (9.9.38) 

There are actually many possible solutions to the above equation, but all of 
them are on-shell equivalent. Thus, in the 0 picture, the theory seems to be free 
of any anomalies. 


References 


1. M. Kaku and K. Kikkawa, Phys. Rev : DIO, 1110, 1823 (1974). 

2. E. Witten, Nucl Phys . B268, 253 (1986). 

3. S. Giddings and S. Wolpert, Comm. Math. Phys. 109, 177 (1987). 

4. J. Harer, Invent. Math. 72, 221 (1982); Ann. of Math. 121, 215 (1985); J. Harer 
and D. Zagier, Invent. Math. 85, 457 (1986). 

5. R. C. Penner, in Mathematical Aspects of String Theory , S. T. Yau, ed.. World 
Scientific, Singapore (1986); Comm. Math. Phys. 113, 299 (1987). 

For reviews, see Refs. 6-10. 

6. M. Kaku, “String Field Theory,” Int. J. Mod. Phys. A2, 1 (1987). 

7. W. Siegel, Introduction to String Field Theory , World Scientific, Singapore (1988). 

8. T. Banks, SLAC-PUB 3996 (1986). 

9. P. West, CERN/TH-4660 (1986). 

10. C. Thom, Phys. Rep. 175, 1 (1989). 

11. S. Mandelstam, Nucl. Phys . B64, 205 (1973); B69, 77 (1974). 

12. E. Cremmer and J. L. Gervais, Nucl. Phys. B76, 209 (1974); Nucl. Phys. B90, 
410(1975). 

13. W. Siegel, Phys. Lett. 142B, 276 (1984); 151B, 391, 396 (1985). 

14. T. Banks and M. Peskin, Nucl. Phys. B264, 513 (1986). 

15. W. Siegel and B. Zwiebach, Nucl. Phys. B263, 105 (1985). 

16. H. Hata, K. Itoh, T. Kugo, H. Kunitomo, and K. Ogawa, Phys. Lett. 175B, 138 
(1986). 



312 9. String Field Theory 


17. M. D. Freeman and D. I. Olive, Phys. Lett. 175B, 151 (1986). 

18. M. Peskin and C. B. Thom, Nucl Phys. B269, 509 (1986). 

19. M. Bochicchio Phys. Lett. 193B, 31 (1987). 

20. O. Alvarez, Nucl. Phys. B216, 125 (1983). 

21. S. Giddings, Nucl. Phys. B278, 242 (1986). 

22. S. Giddings and E. Martinec, Nucl. Phys. B278, 91 (1986). 

23. S. Samuel, Phys. Lett. 181B, 249 (1986). 

24. D. Gross and A. Jevicki, Nucl. Phys. B282, 1 (1987). 

25. E. Cremmer, C. B. Thom, and A. Schwimmer, Phys. Lett. 179B, 57 (1986). 

26. D. Friedan, E. Martinec, and S. Shenker, Nucl. Phys. B271, 93 (1986). 

27. E. Witten, Nucl. Phys. B276, 291 (1986). 

28. C. Wendt, Nucl. Phys. B314, 209 (1989). 

29. C. R. Preitschoff, C. B. Thom, and S. A. Yost, UFIFT-HEP-89-19; I. Ya. Aref’eva, 
P. B. Medvedev, and A. P. Zubarev, SMI-10-1989. 

30. O. Lechtenfeld and S. Samuel, Nucl. Phys. B310, 254 (1988). 



CHAPTER 10 


Nonpolynomial 
String Field Theory 


10.1 Four-String Interaction 

String field theory, so far, has been relatively clean and simple. For example, 
the light cone string field theory for closed strings [1] was purely cubic, yet it 
successfully reproduced the highly nonlinear theory of Einstein. The covariant 
version of the open string field theory [2] was even simpler, being just a Chem- 
Simons term. 

The covariant closed string theory, however, is where the real physics 
lies. The heterotic string is necessarily a closed string theory. However, the 
generalization of Ref. 2 to the covariant closed string case has proven to 
be unexpectedly difficult. Several groups have attempted to generalize the 
midpoint-type interactions to the closed string case [3,4]. In Fig. 10.1, we see 
the symmetric three-string vertex (which has the geometry of a cookie cutter). 

This symmetric configuration of three closed strings has caused consider¬ 
able confusion in the literature, with several claims that this three-string cubic 
interaction is sufficient to yield a successful closed string theory field. How¬ 
ever, this cubic interaction fails to reproduce the amplitudes of string theory. 
Higher interactions, in fact, are required to yield a successful theory. The cor¬ 
rect approach is to formulate a nonpolynomial action for the closed string 
field theory. The nonpolynomial action was developed by Kaku [5, 6] and 
also by the Kyoto/MIT group [7]. It is easy to see, using pictures alone, that 
this interaction is not gauge invariant and that it does not reproduce the usual 
Shapiro-Virasoro amplitude [8, 9]. 

The symmetric configuration cannot be gauge invariant because the midpoint 
of four or more closed strings is no longer a common point. To see this, let us 
return to the case of open strings. If we make successive gauge variations of the 
string field, we find that all strings share a common point, the midpoint. The 




314 10. Nonpolynomial String Field Theory 




FIGURE 10.1. 


common boundary between adjacent strings is always tc/2. Products of these 
strings are therefore associative. Thus, the transformation Sd> = A*d> — $>*A 
creates a series of terms that cancel each other, because the minus sign coming 
from one graph cancels the plus sign coming from another graph. 

For N closed strings, however, this is no longer the case. The reason for this 
is that the string field must be multiplied by the factor 


P = 



dO e i(Lo ~ Lo)e 


( 10 . 1 . 1 ) 


which rotates the string, guaranteeing that the origin of the closed string is not 
a special point. If we now apply this gauge transformation on the closed string, 
we arrive at terms like 


8L - + (10.1.2) 

where the string in parentheses is rotated by an angle 6 from the other two 
strings. Thus, the midpoint is no longer a common point among the four strings, 
and cancellations do not occur. 

In particular, associativity is violated if we insist on keeping the parame- 
trization length of all closed strings equal to 2n. For example, in the product 
(A ★ B) ★ C, we see that the common boundary between strings A and B must 



10.1 Four-String Interaction 315 


14 


1 


2 


4 


3 

A 


FIGURE 10.2. 


2 


3 


B 


be 7t , but that string C may rotate at any angle with respect to the product A*B 
and, hence, may have a variable boundary with strings A and B. However, in 
the product A+(B ★C), we find that A and B no longer share a common bound¬ 
ary of 7T, but that strings B and C do. Therefore, the product is not associative, 
and it is impossible to get a cancellation between these two terms when we 
make a gauge variation of the fields. The beautiful axioms of cohomology in 
Eqs. (9.4.1)-(9.4.4) are clearly violated for closed strings. 

The cubic interaction is also incorrect because it does not properly reproduce 
the Shapiro-Virasoro amplitude. To see this, let us return to the case of open 
string scattering. Let us imagine that two strings come in from the left, as in 
Fig. 10.2(a), and that two other strings come in from the right. This graph is 
part of the s-channel interaction. These two sets of cookie cutters collide and 
then rescatter back to the left and right. In Fig. 10.2(b), we see the configuration 
for /-channel scattering, where two pairs of strings come in from the top and 
bottom. The two cookie cutters also collide and rescatter to the top and bottom. 

Notice that the s -channel configuration can instantly be deformed into the 
/-channel configuration because, at the instant of collision, they have the same 
geometry. By adding the s- and / -channel graphs, we find that we can fill up the 
entire region of integration for the Veneziano amplitude. Thus, the s -channel 
graph occupies the region from 0 to | in Eq. (9.5.20), the /-channel graph 
occupies the region from \ to 1, and the instantaneous deformation of four 
strings takes place at the very center, at x = (This is in contrast to the light 
cone case, shown in Fig. 10.2(c) and 10.2(d), where the /- and m- channel scat¬ 
tering graphs have a totally different geometry and hence cannot be smoothly 
connected without adding a new interaction, the four-string interaction. In fact, 
we find a finite region of moduli space along the x axis that is missing if we 



316 10. Nonpolynomial String Field Theory 


1 



2 




A 


1 



4 



FIGURE 10.3. 


B 


only have three-string light cone interactions in the action. This is, in fact, how 
the four-string interaction was discovered in Ref. 1.) 

Notice that for four closed strings, this argument no longer holds [10, 11]. 
Imagine two closed strings coming in from the left, as in Fig. 10.3(a), meeting 
two other closed strings coming in from the right, such that they are at a relative 
twist of angle 0=0. This is the 5-channel contribution. Notice that, for this 
angle, the four closed strings can instantly rearrange themselves, such that two 
closed strings leave in the up direction and two closed strings leave in the down 
direction. This is the t-channel contribution, shown in Fig. 10.3(b). Therefore, 
at first it appears that the 5-channel graph can instantly be deformed into the 
r-channel graph, leaving no missing region. 

However, this is not true. In Fig. 10.4, we see that the 5-channel interaction 
can take place where the two cookie cutters are displaced by a nonzero angle 
0 between them. Now, when they collide, it is impossible to instantly deform 
the rotated 5-channel graph into a f-channel graph. It is impossible to make 
this transformation, keeping all string lengths equal. 

One problem is that, for closed strings, moduli space for the four-string 
interaction is two-dimensional. Thus, the angle 0, which never appears in the 
open string case, separates the two sets of closed strings and hence prevents 
the instantaneous deformation of 5-channel into f-channel graphs. 

The solution to the problem is that there is a missing region of moduli space 
for the four-string scattering amplitude. Fortunately, it is possible to precisely 



10.1 Four-String Interaction 


317 




e 

FIGURE 10.4. 


fill this missing region if we add a new elemental interaction to the theory, a 
four-string interaction [10-11]. 

To understand how a four-string interaction can fill up the missing region, let 
us first identify what the missing region looks like [10, 11]. The moduli space 
of the Shapiro-Virasoro amplitude, we recall, is simply the entire complex 
plane. This two-dimensional plane, in turn, can be stereographically mapped 
to the sphere. 

Let us analyze Fig. 10.5. In regions I, II, and III, we have the usual 5-, f-, and 
u -channel scattering amplitudes. Notice that these scattering amplitudes fill up 
a two-parameter region of the sphere. This is because, for four-closed-string 
scattering, there are two parameters that can describe the interaction: r, which 
is the distance separating the two pairs of closed strings, and 0 , the relative 
twist angle between these two sets of closed strings. Thus, regions I, II, and 
III can each be parametrized by the two moduli 0 < 0 < 2n and 0 < r < oo. 

Notice, however, that regions I, II, and III do not fill up the sphere, but leave a 
large, triangular portion of the northern and southern hemisphere absent. This 
is the missing region. To understand this, imagine once again the collision of 
two sets of closed strings as in Fig. 10.3, such that the angle 6 separating them 



318 10. Nonpolynomial String Field Theory 



is again set to zero. As we mentioned earlier, only at this point do we have the 
ability to instantly deform the s- and t -channel graphs into each other. 

On the sphere, these symmetrical points correspond to three points A, B,C 
along the equator, which separate the regions I, II, and III. This, in turn, has 
the topology of a Rubik’s cube. Imagine a simplified Rubik’s cube (consisting 
of eight smaller cubes) where we have the ability of rotating the bottom half 
with respect to the top half or of rotating the left half with respect to the right 
half. We have two independent rotations we can make along a horizontal or 
vertical axis, in analogy with Fig. 10.4. Let us say that each rotation must be 
180°. It is easy to see that there are three distinct orientations of the Rubik’s 
cube that we can form by making successive 180° rotations. 

Therefore, we will call these three points A , B, C along the equator the Ru¬ 
bik’s cube points. As expected, at each Rubik’s cube point, there are four ways 
in which we can rotate the Rubik’s cube (either clockwise or counterclockwise 
along a horizontal or vertical axis). These four ways of rotating each of the 
three Rubik’s cube points correspond to the 12 lines that form the boundary of 
regions I, II, and III in Fig. 10.5. 

Notice that, as we perform various rotations away from the Rubik’s cube 
points, we are only modifying the graph by a single parameter, given by 
the twist angle 0. However, we know that moduli space is actually two- 
dimensional, therefore we are moving from the s-channel to the r-channel 
graph by a one-parameter family of rotations, which is not sufficient to fill the 
missing region. 

To smoothly deform an s-channel amplitude to a r-channel scattering am¬ 
plitude requires successive rotations, which are forbidden on the Rubik’s cube. 
For example, imagine rotating the Rubik’s cube 45° along the vertical axis and 
then rotating it again 45° along the horizontal axis. Clearly, the Rubik’s cube 



10.1 Four-String Interaction 319 


© 



FIGURE 10.6. 

will break, which means that the rotations that we can perform on the Rubik’s 
cube only correspond to the boundary of regions I, II, and III, not the interior. 

To see in detail how these manipulations of the missing region can occur, let 
us begin with a tetrahedron, as in Fig. 10.6, such that each of the four triangular 
sides has equal perimeter, given by In [10, 11]. Each of these four sides will 
represent the four closed strings, which have just collided. 

Let the index i represent each of the four faces. Let a t j represent the common 
distance between the zth and j th triangles. We wish to set the perimeter of each 
triangle to be 2n\ 


an + 013 + 014 — 27 r, 
021 + 023 + 024 = 27T, 
031 + 032 + 034 = 2tT, 

041 + 042 ”F 043 — 2jT. 


(10.1.3) 


We have six unknown a t j and four constraints on them, so we have two 
degrees of freedom in which to describe the tetrahedron. Notice that this is 
also precisely the number of degrees of freedom in Koba-Nielsen space. 




320 10. Nonpolynomial String Field Theory 


Then, the three Rubik’s cube points A, 

B , C in Fig. 10.5 can be represented 

A : 

012 = 7T, 

014 = 7t , 


B : 

012 = X, 

014 = 0, 

(10.1.4) 

C : 

012 = 0 , 

014 = It. 



The three rotations (along one hemisphere) that can take us from one Rubik’s 
cube point to another are then represented by 

A B : 0 i 2 = n, — n 0, 

A -> C : a\2 — it -■* 0, a u = tt, (10.1.5) 

B —> C ! 012 + 014 = 7T. 

To find the missing region (i.e., the region allowed by the constraints on the 
tetrahedron), let us take a\ 2 and 013 as the two independent variables. Then, 
solving the constraints, the missing region is given by [ 10 , 11 ]: 

1 012 * 013 5 JT, 

( 10 . 1 . 6 ) 

012 + 013 5: ft- 

(We have also placed the constraint 0 ^ < 7 r, whose meaning we will discuss 
later.) 

Let us now plot the missing region on a graph. In the 012-013 plane, we see 
that the missing region, as expected, is nothing but a right triangle. When this 
triangle is mapped stereographically onto the sphere, we find that it spreads over 
the northern hemisphere, completely filling the missing region. (By permuting 
the legs, we also obtain the southern hemisphere.) By a simple analysis of the 
tetrahedron graph, we find that we can completely fill the missing region Mr 
of the Koba-Nielsen plane. 

To understand how the boundary of the missing region 3 Mr connects various 
Mandelstam channels, let us first analyze the pole structure of the various 
amplitudes. For the four-string scattering amplitude, for example, we have the 
following pole structure 

A(s , t,u) = y^ A[ 2 --—+ (2 ** 3) + (2 4) + M R (s, t, u ), 

1 s \2 -m l 

(10.1.7) 

where / represents an infinite tower of Reggeons, sij represents the energy 
squared of the Reggeon in the channel formed by the ith and 7 th strings, and 
the last term represents the tetrahedron graph, which has no poles. 

Let us now capsulize our results. If we let r 0 for the scattering amplitude 
for two cookie cutters, we find that the topology of the four colliding cookie 
cutters forms a graph, called G 4 , as in Fig. 10.4. This graph has the topology of 
a degenerate tetrahedron, that is, one of its legs, for example, 012 , has a length 




10.1 Four-String Interaction 321 



FIGURE 10.7. 


equal to zero 

G 4 = lim P 4 . (10.1.8) 

ai2-*7r 

Notice that each graph G 4 is parametrized by an angle 0 , that is, the 
angle at which the two cookie cutters collide. Thus, there is a one-to-one- 
correspondence between graphs G 4 and points in the missing region 

G 4 ** dM R4 . (10.1.9) 

Now that we have practiced with the four-string interaction, let us consider 
the higher interactions. For five-point scattering, the poles are represented via 
the energy variable s tJ . We have 

A(Sij, S k l) = ^ ' y ' ^12~ 2^3 2~^45 

permutations J~J $12 ~ S, 5 ~ m] 

+ E E A[ 2 --5—2 M Ls + *«). (10.1.10) 

permutations I $12 

where M 345 and s k! ) represent the tetrahedron and the prism graphs, 

respectively, which have no poles. 

Let us work out the detailed structure for TV = 5 in Fig. 10.7. The missing 
region is given by [6]: 

#12 5: #35 5 #12 + #24 < #35 + 7 T » 

Mr 5 — ' #13 — #24 ^ #13 + #35 < #24 + 7 T > (10.1.11) 

. #12 + #13 ^ 

Now, let us analyze the missing region. Unfortunately, because the missing re¬ 
gion is four-dimensional, it is impossible to visualize its boundaries. However, 
by setting one of the edges to n, we can reconstruct the three-dimensional 
figure corresponding to 3Mr 5 . 

Let us set a 2 4 — n. (Notice that there are 10 permutations we can make on 
this choice). For this particular boundary, we now have 

I #12 5 #35 > 

#13 + #35 > 

#12 +#13 < 7 t . 


( 10 . 1 . 12 ) 



322 10. Nonpolynomial String Field Theory 


The Rubik’s cube points for the five-faced prism graph are more complicated 
than for the tetrahedron. First, we have the Rubik’s cube points for each of 
the tetrahedrons contained within the prism, corresponding to setting a tJ = n. 
They have previously been studied in detail. More interesting is the Rubik’s 
cube points for the prism graph. For a particular permutation of the external 
lines, there are 12 ways in which to obtain the Rubik’s cube points 

Rubik’s cube points: ( ?‘ 4 = 0,3 “V * 25 = fl23 = °> (10.1.13) 

[12 permutations. v y 

[The total number of Rubik’s cube points for all possible permutations of 
external lines, but excluding the lower-order Rubik’s cube points, is given by 

\{N-m 

The N = 5 case, however, is too simple-minded to see the next major 
complication, the fact that there is more than one polyhedron at each level. 
For example, we have been able to identify at least two distinct polyhedra 
at the sixth level, five distinct polyhedra at the seventh level, and 14 distinct 
polyhedra at the eighth level. 

In Figs. 10.8(a) and 10.8(b), we see how the labeling for the polyhedra is 
given for up to TV = 6 . There are two polyhedra at this level, which we call 
( 6)1 and ( 6 ) 2 . To solve for the missing region, we need only to set a t j < n 
and the internal perimeter for three contiguous polygons to be greater than 2tt . 
The complication in finding the missing region is that there are many hidden 
identities among the various legs that one must factor out in order to find the 
following for the missing region. If we choose an, *2 25 , 034, 036? 046? 0i6 as our 
set of independent variables, then we find for missing region M R(Kl) of the cube 
[ 6 ]: 




036 

< 

012 + 025 < 016 + 036 + 046 

< 7T + an + 025? 

012 

+ 

025 

< 

2*236 + 034 + 016 + 046 ~ 7T 

< 7T + *21 2 + *2 2 5, 

012 

+ 

025 

< 

046 + 034 + 036 5: + #12 + 025? 



025 

< 

034 + 046 + 036 + 016 — TC < Tt + *2 2 5? 



025 

< 

036 + 016 S K 025? 




025 

< 

034 + 036? 


036 + 034 

+ 

046 

< 

2 71, 




012 

< 

*246 + 036- 



Let us now analyze the structure of this missing region. The missing region, 
of course, is six dimensional and cannot be visualized. However, it is possible 
to analyze the boundary of the missing region rather simply. If we let a\ 5 go 
to zero and a u = jt 9 then this is equivalent to making the polygon formed by 
the first and fourth strings have circumference 2tu, that is, the first and fourth 
strings form a cookie cutter and hence can be removed from the polygon, 
creating a five-faced prism. In this way, we can reduce the missing region of 



10.1 Four-String Interaction 323 




FIGURE 10.8. 


the cubic polyhedron into the prism graph or lower. We find 

lim M Rm = M Ri . (10.1.15) 

ai5^0;ai4-^7r 

Since we have already analyzed the boundary of the missing region of the 
prism graph, we have now decomposed the missing region of the cubic graph. 

There is one more limit one can take on the missing region, and that is to 
take the perimeter surrounding three faces to be equal to lit. Then, the cubic 
splits into two tetrahedrons. For example, we have 

lim M r = M Ra © Mr, , (10.1.16) 

Pi25-+2x 

where Pns is the perimeter that encloses the first, second, and fifth strings in 
the cube. Thus, once again, we found have that the boundary of the missing 
region can always be decomposed into lower-order polyhedra, so there are no 
new surprises. 

Let us analyze the other, asymmetric six-faced polyhedra shown in Fig. 
10.8(b). Again, there are many hidden identities that prevent a simple analysis 
of this figure. However, we find that a basic set of independent variables is 
given by #i 6 , # 13 , as6 , # 26 ? # 34 ? #46 and that the dependent variables are given 
in terms of this independent set [6]: 


#36 = 2n — # 16 — #26 ~ #46 ~ # 56 ? 

#23 — — #13 “ #34 + #16 + #26 + #46 + # 56 ? 

#12 = 2 n — 2#26 + #13 + #34 ~ #16 ~ #46 “ # 56 ? 
#15 = #26 — #13 + # 46 ? 

#14 = —#34 T" #56 + #26 — # 13 ? 

#54 = 2 n — #56 — #26 + #13 — # 46 - 


(10.1.17) 


The restriction that all perimeters that bisect the polyhedron have lengths 
greater than or equal to 2iz then serves to determine the entire missing 



324 10. Nonpolynomial String Field Theory 


region M Rm) : 


#13 < #26 5 #13 + #34? 

#13 < #26 + #46 5: 71 + 0]3, 

#13 + #34 < #46 + #56 + #26 5: 27T + £13, 

#13 < #16 + #56 + #26 + #46 < 2jT + #13, 

#34 < #16 + #56 + #26 < #34 + 27T, (10.1.18) 

K < #16 + #26 + #46 + #56 < 27T, 

#16 + #56 + #26 + #46 5: 7T + #13 + <234, 

#26 + #16 + #46 + #56 < 27T + #13 + £34 — 026? 

< 71 + 026 + #16 + #46 + #56* 

It is now straightforward to analyze the boundary of the missing region for 
the ( 6)2 polyhedra and show that we can also reduce it to lower polyhedra by 
taking specific values of the legs. For example, by taking 012 to be n and 013 
to be 0, we can substitute these values into the missing region Mr 6(2) and show 
that it reproduces the prism graph, that is, 

lim M Rm = M Rs . (10.1.19) 

ai 2 ->n;au->0 

For this configuration, we see that the first string has disappeared, reducing the 
six-faced polyhedra down to a five-faced polyhedra. 

Similarly we can show that, by taking the perimeter that surrounds the first, 
second, and third polygons to be equal to 2n, the polyhedron (6) 2 splits into 
two smaller tetrahedrons 


lim Mr 2) — Mr 4 © Mr 4 . (10.1.20) 

Pl23-+27T 

These two reductions of the polygon (6) 2 into lower polyhedra simply represent 
the fact that we are taking clusters of polygons to represent external strings 
with length 2tc, thus confirming again that the boundary of missing regions 
always connects different Mandelstam channels corresponding to midpoint 
scattering, that is, scattering of cookie cutters and clusters of cookie cutters. 


10.2 TV-Sided Polyhedra 

Let us generalize some of this to the //-sided polygon case. Let N be the 
number of faces of this polygon. Then, the number of vertices or comers C 
and the number of edges E can be written as follows: 

N = # faces, 

E = 3(JV - 2) = # edges, 

C = 2(AT — 2) = # comers. 


( 10 . 2 . 1 ) 



10.2 TV-Sided Polyhedra 325 


Let us label each of these faces by /, which numbers from 1 to TV. The total 
number of variables appearing in our formulas given by a i} is equal to the 
number of edges E. 

Let us now calculate the number of independent variables within the TV- 
sided polyhedra. We set the total perimeter of each side (corresponding to an 
external closed loop) equal to 2n. This constraint can be easily enforced by 
setting 


N 

52 ay = 2n. (10.2.2) 

1 = 1 

Then, the total number of independent variables is equal to the number of a t j , 
or edges E , minus the number of constraints, or TV. Thus, the total number of 
variables (or Koba-Nielsen variables) is equal to 

E — N = 2N — 6 = # Koba-Nielsen variables, (10.2.3) 


which is the correct counting. (For the TV-point scattering amplitude, there are 
2TV Koba-Nielsen variables zu but six of them can be eliminated by choosing 
three of them to be the points 0, 1, oo, leaving the number 2TV — 6 for the 
independent Koba-Nielsen variables.) 

Now, let us compare this with the number of variables appearing in the 
conformal map. The map is [5,6]: 


dp v n,=i 2 [(z - Wj){z - Wj)] l/2 _ f(z) 
dz njLi(z - jo -) s(z)' 


(10.2.4) 


The unknowns coming from y t are directly related to the Koba-Nielsen 
variables. Notice that we have 2TV variables contained within the complex y t . 
However, we know that three of them can be fixed to be 0, 1, oo, so we really 
only have 2TV — 6 variables within the y i9 which is precisely the number of 
Koba-Nielsen variables. 

The total number of remaining unknowns is given by the 2 x 2 x (TV — 2) 
variables contained within the complexes and u>, , as well as two coming 
from TV. Thus, the total number of remaining unknowns is equal to 4TV — 6. 

These remaining unknowns can be fixed by placing external constraints on 
the theory. Notice that we must set TV external strings to have length 2i r, which 
can be enforced, as before, as 


2tt = lim 


f(z)(z - Yi) 
g(z) 


(10.2.5) 


This gives us a total of 2TV constraints. 

Last, we have the constraints coming from the fact that the collision of TV 
strings takes place simultaneously at a constant value of r in the z plane. Thus, 
we wish to set the real parts of all interacting points to be the same 


Re p(Wi) = Re p(wj) 


(10.2.6) 



326 10. Nonpolynomial String Field Theory 


for all i and j. Since there are A — 2 pairs of interacting points, this gives us 
2 x (N — 2) constraints, minus 2. Altogether, we have 2N + 2 (N — 2) — 2 
constraints, for a total of AN — 6 constraints. Notice that this is precisely equal 
to the number of unknowns in the mapping. 

Let us now summarize how the counting proceeds for the A-sided polyhedra 
and also the conformal map. For the Koba-Nielsen variables, we have 


Koba-Nielsen variables = 2 N — 6 = 

while for the unknowns or the constraints, we have 
AN — 6 unknowns: (A, n), 

AN — 6 constraints: [/>(/,-), Re p(zi) the same]. 


E — N, 
KVi) ~ 6 , 


(10.2.7) 


( 10 . 2 . 8 ) 


Now that we have determined precisely the relationship between constraints 
and unknowns, we must tackle the more difficult constraint of setting limits 
on the range of the a t j . The key to this is to realize that, for the cases of N = 4 
and A = 5, we had the curious constraint a t j < n. 

To understand this curious constraint, let us introduce the idea of “slicing” 
the polygon in half, that is, dividing up the N faces into two sets, such that the 
faces in each set are contiguous. Let us say that the number of ways we can 
partition the polygon into two sets is labeled by /. Call the set on the left L 
with elements labeled by i and the set on the right as R with elements labeled 
by j. The number of contiguous faces within L or R must be greater or equal 
to 2. Then, define Pj as the perimeter of the slice 


p,= £ fly. (10.2.9) 

i €L;jeR 


We will now generalize the curious constraint as follows for the arbitrary 
N- sided polyhedra [5-7]: 

P,>2n for all/. (10.2.10) 


If we define M N to be the region in ay space defined by the /V-sided polyhedron, 
then the boundary of M 4 corresponds to P t —2n: 

dM N = lim M n . (10.2.11) 

P ,= 271 

For example, for the N = 4 case, there are three ways in which we can slice 
the tetrahedron, with two faces in L or R. 

Let us take the partition so that i — 1 and i = 2 faces are within L. But, 
demanding that the perimeter P/ of the four-sided figure in L be greater than 
2 jt is equivalent to fixing a constraint on the common boundary between these 
faces a 12 < it. Thus, the origin of the constraint ay < n , first found for N = 4, 
is simply the constraint that P/ > 2tt for the tetrahedron, that is, 

ay < jt +> Pi > 27t for N = 4. 


( 10 . 2 . 12 ) 



10.3 Nonpolynomial Action 327 


10.3 Nonpolynomial Action 

Let us now write the nonpolynomial action that obeys all these constraints 
[5-7]. There are, however, several major complications in addition to those 
found for the open string field theory. 

First, we must deal with the ghost counting problem. Since the Fock space 
of the closed string field *F( X ) is composed of products of the left-moving 
and right-moving states of the open string field <F(X), the ghost number of the 
closed string field must be integral. By carefully examining the cohomology 
of *F, it can be shown that it contains two complete transverse closed string 
Fock spaces, with ghost numbers 0 or — 1. For either case, however, the naive 
closed string action must vanish because the ghost numbers do not sum to zero 

('F|<2'F) =0. (10.3.1) 

If the field has ghost number 0 ( — 1), then the naive action has ghost number 
+1 (— 1), so the action vanishes in either case. 

We have two ways in which to get the ghost numbers to match. We can insert 
the operators bo — bo or c 0 — c 0 into the action if *F has ghost number 0 or 
— 1. The action is then nonzero. But, there is the second complication, which 
is that we must somehow obtain the constraint 

(L 0 - L 0 )|vF> = 0, (10.3.2) 

which states that the string field should have no dependence on the origin of 
the coordinate axis. Since this constraint does not emerge from the action, it 
must be imposed from the outside, as an additional constraint (although at¬ 
tempts have been made to derive this from an additional ghost constraint). [To 
deal with these questions, the philosophy we take in this chapter is that the 
nonpolynomial action is inherently a gauge fixed action. Because reparametri- 
zation invariance lies at the heart of string theory and since the nonpolynomial 
action breaks reparametrization invariance (e.g., because the string has fixed 
parametrization length 2 jt), we will treat the action as the by-product of gauge 
fixing a higher action, so that the imposition of Eq. (10.3.2) from the outside 
poses no problem. If certain rules for the ghost modes seem a bit artificial, 
it is because the nonpolynomial action is the gauge fixed by-product of a 
reparametrization invariant action.] 

There are, therefore, several ways in which to construct the closed string 
action at the free level that lead eventually to the same cohomology and hence 
identical transverse states. We thus make the following choices. We define the 
string field as follows: 

vF = c~\ct>) +c-c+\f) + lx) +4\flh (10.3.3) 

where the physical transverse states reside in the lowest excitation of \(p) and 
where = \{c 0 + c 0 ) 9 = c 0 - c 0 , b 0 = b 0 + b 09 and = \{b 0 - b 0 ). 

So, the action reads 


mQbo\V), 


(10.3.4) 



328 10. Nonpolynomial String Field Theory 


where we define the string field 'F to have the factor P of Eq. (10.1.1) inserted at 
all times. We will simply abbreviate this action by the usual notation ('*F \ Q ^), 
where it is understood that we apply all these conventions. (In general, we will 
omit the ghost insertions for convenience. They can be easily reinserted.) 

When we generalize this action to the interacting case, we find yet another 
complication, which is that there are many polyhedron at the Nth level that 
satisfy all the constraints. Let us label by the index i the various polyhedra at 
the Nth level that satisfy all constraints. 

For example, for N = 4 and N = 5, we have only one polyhedron that sat¬ 
isfies these constraints. However, for the N = 6 case, we have two polyhedra. 
At the N = 7 case, we have five distinct polyhedra, and at the N = 8 level, 
we have 17 different polyhedra. 

From now on, we will label the zth polygon with n faces as (w),-. As we did 
in the light cone and covariant open string cases, we can define the following 
vertex function (without ghosts), which satisfies the constraints of the (n)i 
polygon 


n 

i=l 


7 = 1 


where 


and 


Vv 


XjiOTj) 

|V(„),.> = 0, 

(10.3.5) 

&i j )^ (Q j &i ) 

(10.3.6) 

if dij ^ 0, 

otherwise. 

(10.3.7) 


If the ith and j th polygons share no boundary, then a t j = 0, and Eq. (10.3.6) 
is equal to 0. Also, b t j and for the (w) f polyhedra are defined to be the 
common boundary between the adjacent polyhedra and are a rather obvious 
generalization of the three-string vertex in Eq. (9.2.20) in the light cone case. 

Then, let us define, for the (n)i polyhedra, the following string functional: 

{V n )i = (nir^il <*2l <*n]l • • • V (n)i ). (10.3.8) 


Now, let us define the field functional for the n-sided polyhedron by summing 
over the index i: 

(V n ) s^cOOiOP"),, (10-3.9) 


where the coefficients c(n), tell us how to weight the ith polygon with n sides. 
Finally, we write the action for the nonpolynomial theory 


OO 

L = (*\Q*} + J2 a n(V n ), 

n =3 


(10.3.10) 



10.3 Nonpolynomial Action 329 


which we demand is invariant under 


W = GIA> + 52a.I'1'' , a}. (10.3.11) 

n —1 

Our goal is to find explicit values for a n , f}„, and especially c(n), by de¬ 
manding that the action be invariant under the variation. One complication 
is that external strings are always defined such that their length is 2it, while 
lines appearing within the vertex function may have lengths different from 2 n. 
Thus, to differentiate between these two cases, we will use the double bars 11 
whenever the contraction is over states with length In. 

The variation of the vertex function is thus 


<${'1'") =n('t'"- 1 ||<$'l/). (10.3.12) 

We have used double bars here, because the length of the string <5|>k} is 2jt. 
Now, we can take variation of the entire action 


00 

SL = 2(G*P*) + ^na n (vp' ! - 1 ||SxI/) 

«=3 


= 2 evp 


X>V"A) + jn„a„<vl/*- 1 || e A> 


n =1 


n=3 


+ J2 

72=3,772 = 1 


(10.3.13) 


This variation is equal to zero if we have 

2<£vi/|| / 6„_ 1 q/' , - 1 A> +(„ + l)a„ +1 (vp«||2A) 

72-2 

+ ~P+ l)an- P+ iP P (* n - p \\* p A) = 0. (10.3.14) 

P =1 

Expressed in this way, the formula may not be that transparent, so let us write 
it in the form 


72-2 

(—l)"<'k"||<2A) + «(2'k||'k” _1 A) + ^ Cp('k" _/ ’||'I' P A) = 0, (10.3.15) 

p =1 

where the unknowns are now encoded within C n p and c(n)/. 

We now notice another complication not found in the open string case. We 
realize that the vertex function in Eq. (10.3.14) cannot be BRST invariant 

72 

J^QAV (n)i )^0. (10.3.16) 

2 = 1 

This means, of course, that the theory cannot be written with the standard 
cohomological axioms [Eqs. (9.4.1)-(9.4.4)] that made the open string theory 
so elegant and simple. Any naive attempt to fit the closed string field theory 



330 10. Nonpolynomial String Field Theory 


into the cohomological framework is doomed to fail because of the failure of 
BRST invariance of the vertex function. 

From Eq. (10.3.14), we can immediately write the first equalities 


fin — ^( — + l)(n + 2)a„ +2 , 

C n _ (n-P + tyn-p+lfip (10.3.17) 

p (n + I)a „+1 

Let us, for the moment, keep only the C" as unknowns. Then, we can define 


so that 



fin 
c ; 


2 n ~ l g n ~ 2 

nn + 1! 

(-1) H ~ 1 2> + 1 )g n 

V+\\ 

+ 1 ) 

n — p\p + 1! 


(10.3.18) 


(10.3.19) 


where g is a one-parameter degree of freedom within the constraints that 
corresponds to the coupling constant. 

Now comes the more difficult part, actually calculating the various coef¬ 
ficients c(n), that appear within the expansion. The calculation is long and 
arduous, so we will only mention the important aspects of the calculation. Let 
us define sin), as the number of ways that the n, polyhedron can be rotated 
into itself. For example, for the lower polyhedra, it is easy to see (Fig. 10.9): 


< 4 ) = 12 , 

5(5) = 6, 

5(6)! = 24, 5(6) 2 = 2, (10.3.20) 

5 ( 7 )! = 10, 5(7)2 = 3, 5 ( 7)3 = 3, 

5 ( 7)4 = 5 ( 7)5 = 2. 

It turns out that the key to the entire calculation lies in the identity 

5(n),c(n), = s(n)jC(n)j (10.3.21) 

for any i and j. Then, we can write everything in terms of s(n)i and c(n) l : 

a n = 2 n -V -2 /5(n)ic(/i)i, 

5(n)i = 2 (n — 2). 


(10.3.22) 



10.4 Conformal Maps 331 







If we factor out s(rt)\ in the vertex, then we can write a new vertex, called 
the symmetrized vertex 

= (10.3.23) 

s(n) i 

Then, the final result is 

L = i('P|j3'P> + Er=3 2 n “V _2 (^")sYM. (10.3.24) 

This action can be written in a slightly more transparent fashion. Set g = 
and write the interaction explicitly in terms of the (n)j polyhedra. Then, the 
action becomes 


L = i<vI/|e*) + £~3^. (10.3.25) 

This is our final form for the action. It seems rather surprising that the overall 
coefficient of the interaction corresponding to the polyhedra (n)j, modulo 
rotations, is exactly equal to 1! 


10.4 Conformal Maps 

For the scattering of four closed strings (with equal circumference) at the tree 
and one-loop level, the conformal map is easy to write. For the tree amplitude, 
the Schwarz-Christoffel transformation gives us the following map: 

d p = v nti(* - v i) ul 

^ n u^-rjy 


(10.4.1) 



332 10. Nonpolynomial String Field Theory 


where the points y, are mapped to infinity and represent external strings, while 
Riemann cuts connecting the various v, represent the interaction, that is, the 
line along which two closed strings merge into a third. 

To obtain the one-loop, four-puncture scattering amplitude, one simply re¬ 
places the various factors (z — z,) appearing in the above map with 6\{z — Zi). 
However, for higher loops, replacing 0\ with the generalized 0 function fails 
to yield the conformal map for the genus g conformal map. The generalization 
to the arbitrary case is nontrivial. 

As a result, we will derive the conformal map dp = co z dz for the g-handle, 
p-puncture graph by carefully analyzing uniqueness arguments. 

We wish to have: 

(1) a conformal map whose square transforms as a quadratic differential; 

(2) a map whose only singularities are double poles at p points ’/,, and whose 
only zeros are 2p + 4g — 4 points v,-; and 

(3) a periodic function defined on the surface so that the point z going around 
any a cycle or b cycle will return back to the same point. 

Our task is to use these restrictions to find the unique conformal map taking 
us from a Riemann surface with g handles and p punctures to the flat two- 
dimensional world sheet describing a Feynman-like diagram with g internal 
loops and p external legs. 

To find the unique conformal map, it will be useful to first review some 
essentials concerning holomorphic and meromorphic functions defined on a 
Riemann surface. On a genus g Riemann surface, we can define g first Abelian 
differentials on, on the torus that have no singularities. 

We can now define f2 as the period matrix, which obeys 

Qij = <p (Dj, Sij = (p coj. (10.4.2) 

Jbj Ja, 

The period matrix is conformally invariant. To each distinct period matrix there 
is a distinct Riemann surface. 

Now, we wish to define periodic functions on this Riemann surface. Let us 
review how this was done for the single-loop torus. For that case, we know 
that we can deform the a cycle and b cycle so that they intersect at a common 
point on the surface. If we cut the torus along the a cycle and b cycle and then 
unravel the surface, we find a parallelogram, as in Chapter 4, whose opposite 
sides are identified. If we copy this parallelogram an infinite number of times 
on the complex plane, then we obtain a lattice. Then, it is straightforward to 
define a if function on this surface that has the correct periodicity properties. 
This periodicity is achieved by summing a function over the infinite lattice, so 
that displacements along the lattice leave the function invariant. 

Let us repeat these same steps for the genus g Riemann surface. There are 
now 2 g cycles. Now, take a point on this surface and deform the cycles so that 
they all intersect this point once. Cut along these 2 g cycles that intersect this 
point. Unravel the surface and find a polygon whose sides are identified. By 



10.4 Conformal Maps 333 


traveling across any side of this polygon, we return to the polygon, but from 
another side. Now, extend this polygon periodically in all directions, so that 
we have a lattice of polygons. 

Let us label the sites of the lattice. Let boldface indices n be g-component 
vectors with integer entries. Then, the generalized © function can be defined 
on the lattice. Its periodicity property arises because it is summed over all 
lattice sites. It can be defined as [12-15]: 

@(z|£2) = ^ exp (i7tn T Qn + 27r/n r z). (10.4.3) 

neZ8 


The 0 function is no longer defined as a simple function of the complex 
parameter z. Instead, it is defined in terms of a vector z on the lattice 


z = 



(10.4.4) 


where po is an arbitrary point (which will disappear when we form the 
conformal maps). 

As we know from our discussion of the single-loop torus, we can also define 
a spin structure on the surface, depending on whether we have periodic or 
antiperiodic boundary conditions when we move completely around a cycle. 
There are four possible spin structures for the torus. Likewise, we can define 
a generalized © function on the lattice with a spin structure 


© 


a 

A 


(z|Q) = 


exp [i7tr T Qr + 2nir T (z + /?)], 

r— aeZ8 


(10.4.5) 


where the spin structures [a] and [/3] are two component spinors with g entries. 

There are l 2g possible spin structures, corresponding to the different ways in 
which we can transport a two-dimensional spinor across the various boundaries 
or cycles of the polygon, picking up factors of+1 or — 1 in the process. Under 
a shift in the lattice 


z->z + £2-n + m, 


(10.4.6) 


we find that the generalized 0 function transforms as 


© 


a 


(z + Q • n + m|£2) = « 


a 

p\ 


(z|«). (10.4.7) 


We can also define the function that generalizes the function n — Zj on the 
complex plane. It is called the “prime form,” and is represented by © functions 
as follows 



334 10. Nonpolynomial String Field Theory 


where: 


a 

L/>J 


(*) = E- 


30 


a 

fi\ 


(0|fi) 


dn 




(10.4.9) 


where a and /3 label the spin structure on the Riemann surface. (The prime 
form’s dependence on the spin structure will drop out.) 

We will also make use of the celebrated Riemann vanishing theorem [12, 
13], which allows us to compute the zeros of the 0 function. It states that the 
function 


0 



(10.4.10) 


either vanishes identically or has g zeros, which are located at the points z,. 
which satisfy 



+ A Zo + n + £2m, 


where 


(Ajo); 



(10.4.11) 


(10.4.12) 


Unfortunately, with © and E{z, z') alone, we cannot satisfy constraints (1), 
(2), and (3). We need yet one more function defined on the lattice, given by 
the a(z) function of Refs. 13 and 15: 


a(z) = exp 



In E(z', z) 


(10.4.13) 


In Ref. 16, these three functions were used to construct the conformal map for 
the multiloop open string case. We will generalize this map for the genus g 
closed string case. 

Let us now analyze the singularity and conformal properties of these three 
functions. We note that the prime form E(z', z) transforms as a — \ differential 
in z, that 0 is locally a zero differential, and that a transforms as a g/2 
differential. 

E(z', z) has a zero at z! = z but no poles, a has no zeros or poles, and 0, by 
the Riemann vanishing theorem, can have g zeros [12,13]. Then, there is one 
unique combination of 0, prime form, and the a function that transforms as a 



10.4 Conformal Maps 335 


quadratic differential and has the correct zero and pole structure [16, 17]: 



x 


n-4 + +f 4 £(^ift) 

W P j= \ E(z, y,|£2) 1 2 


kOOl 3 - 


(10.4.14) 


By counting the differential order of each factor, we see that the conformal 
map has the correct order of 2: 


- \(2p + 3# - 4) + p + y - 2. (10.4.15) 

The conformal map is then found by taking the integral of the square root of 
the map. 

Let us now relate the singularity of the map with the world sheet of the 
closed string scattering amplitudes. Because of the prime forms E(z r , z ) in the 
denominator, the double poles, corresponding to the p punctures, are located 
at y h Furthermore, the 2p + 4g - 4 zeros at v* come from the © function and 
the prime forms in the numerator. The double poles, after taking the square 
root, then correspond to the p external lines of the string scattering world 
sheet. Each pair of zeros of the map, in turn, correspond to the merger of three 
closed strings. (At first, the map seems to be unsymmetrical with respect to the 
various V/, since some of them are to be found within the © function and others 
within the prime form. However, by simply redefining the constant A, we can 
interchange the various v, and show that the function is really symmetrical in 
all v f .) 

Now that we have explicitly constructed the conformal map for the non¬ 
polynomial theory for arbitrary genus g and puncture p , let us count modular 
parameters and verify that we have the correct counting. In general, we want 
the total number of unknowns, contained within the complex parameters in the 
map, to be equal to the sum of the dimension of moduli space (6g - 6 + 2p) 
plus the number of constraints we place on the complex parameters to fix the 
overall shape of the conformal surface, that is, 


# unknowns = # constraints + # moduli. (10.4.16) 

This equation is easily checked. The total number of unknowns in the con¬ 
formal map is given by the complex variables N , , and v*, which, respectively, 

total 2 -h 2p + 2(2p -F 4g — 4) = 6p 4- 8g — 6 unknowns. Furthermore, the 
dimension of moduli space is equal to 6g — 6 + 2p. The number of constraints 
can be broken down as follows: 


(1) six come from fixing the overall proj ective transformations on the complex 
plane; 

(2) 2 p come from fixing the residue of the pole at each y t to be a real number 

2 jt, which fixes the circumference of each cylinder; 



336 10. Nonpolynomial String Field Theory 


(3) 2(p ± g — 2) come from fixing the real and imaginary parts of the various 
Riemann cuts to conform to the geometry of a closed string scattering 
amplitude. 

If we pair off the points v z , then 

Rep(v z ) — Re p(vj) (10.4.17) 

for all i and j in a pair. This places the Riemann vertically in the complex p 
plane. Then, we also have 

Im p(vi) — Im p(vj) = ±7i (10.4.18) 

which fixes the overlap between the /th and j th string to be n. 

Last, we must subtract 2 from the number of constraints. This is because the 
system is actually overconstrained. The sum of the residues at plus the line 
integral around the Riemann cuts equal zero. [If we take a line integral around 
an infinitesimally small circle in the complex plane, the residue is zero. Now, 
expand this small circle until it engulfs all pairs of Riemann cuts and extends 
out to infinity. Because the line integral is still zero, this means that the sum 
of the residues at y t do not sum to zero (as in the light cone case), but cancels 
against the sum of the line integrals around the Riemann cuts.] 

Putting everything together, we now have 4p + 2g constraints, so the total 
number of unknowns 6p + 8g — 6 equals the sum of the number of moduli 
plus the number of constraints, as expected. 

The conformal map also accommodates the possibility of arbitrary polyhe- 
dra occurring within a loop amplitude. Polyhedra occur when the real parts of 
several pairs of Riemann cuts coincide and the difference of their imaginary 
parts no longer equals jt. Since each v z is mapped onto a vertex of a polyhe¬ 
dron, then the various edges within the polyhedron can have varying lengths, 
corresponding to varying differences between the imaginary parts of p(v z ). 

For example, the conformal map can create a polyhedron with M vertices 
when the real parts of M/2 pairs of Riemann cuts all have the same real part 
Re p(vi ). By varying the differences within their imaginary parts, one can vary 
the lengths of the edges of the polyhedron. In this way, by clustering pairs of 
Riemann cuts located at v z into different groups, we can create polyhedra of 
arbitrary complexity within a loop diagram. 

(As an aside, notice that the map is so general that we can also accommodate 
all possible light cone configurations as well by changing boundary conditions. 
By letting the external strings have arbitrary circumferences and by collapsing 
all pairs of Riemann cuts into single points, we find that the sum of the residues 
of the poles now sums to zero, thus reproducing the boundary conditions of 
the light cone theory. Furthermore, it is commonly thought that one cannot 
smoothly distort a theory where strings interact at their midpoints into a theory 
where strings interact at their endpoints, as in the light cone theory, that is, the 
light cone limit is singular. However, this is incorrect. By explicit computation, 



10.5 Tadpoles 337 


one can show that one can smoothly take the limit in the conformal map and 
reach the light cone theoiy, as predicted in geometric string field theory [6].) 


10.5 Tadpoles 


The conformal map [Eq. (10.4.14)] is general enough to include all possible 
multiloop graphs, including the one-loop tadpole, where the residues of the 
poles do not have to sum to zero. Unfortunately, we will show that these tad¬ 
poles violate modular invariance (giving us an infinite overcounting of moduli 
space) and hence pose problems for our original action [Eq. (10.3.24)]. Orig¬ 
inally, it was thought that the violation of modular invariance for the tadpole 
was sufficient to kill all possible closed string field theories. We will see that 
there is a simple resolution to this puzzle, which is that Eq. (10.3.24) must be 
treated as a classical action and that quantum corrections must be added to it to 
restore modular invariance. (These tadpoles do not occur in the light cone field 
theory because the string length is proportional to the momentum /?+. Since 
momentum is conserved, so is string length across a three-string vertex, so 
tadpole graphs are forbidden. However, in the nonpolynomial theory, because 
string lengths are fixed and therefore independent of the momentum, tadpole 
graphs must be considered.) 

Let us specialize to the case of g = 1. Then 

a(z)=l, = (10.5.1) 

and the prime form reduces to 


E(z, w) - 


2niy/zw®\\\\ 


In (z/w) 


2tz i 




0[j](°|fi) 


Then, the map for the g = 1, p-puncture diagram is 


(10.5.2) 


dP _ VnT-i 6 ^ z ~ v ‘) e i(z - W) 

dz nr=i °\(z - Vi) 


(10.5.3) 


where u, and v, are the splitting points, y, the punctures, and 

N = — ~ Yi) _ . (10.5.4) 

VTE.1 0i(Ki - h)0i(Ti - Vi) 

In this form, the poles and zeros of the map are manifest. However, from the 
theory of conformal maps, we also know that Riemann surfaces with genus 
g < 2 can be written as hyperelliptic functions (without using © functions) 
that are formed by gluing conformal planes together across several Riemann 



338 10. Nonpolynomial String Field Theory 




cuts. Let us check, therefore, that our map can be written as a hyperelliptic 
function for the one-loop, one-puncture tadpole, which can be written as [19]: 


d p _ >u 

dz s/z(z — l)(z —x)’ 


(10.5.5) 


where we join the two surfaces along cuts that go from 0 to 1 and from x to 
oo (see Fig. 10.10). 

To show the equivalence between Eqs. (10.5.3) and (10.5.5), let us first make 
a series of changes of variables. We define z = t 2 and 


O^Oiiy) 

02 W 


_ r‘ _Jr_ 

/X ~Jo V(1 -r 2 )(l -kh 2 ) 

0i = 0 ,( 0 ). 


(10.5.6) 



10.5 Tadpoles 339 


Using standard theta function relations, one finds 

0l(v)0l , ,. 2.2 d l( v ) e l 


i 2 ^2' 

= e!WY 

Also, define 


6&(v) 


o* ele 2 {y)e^v) ^ 

dt = — ——;- dv. 


02 eliv) 


y = 


03 01 (vO 


or' 
oJ . 


e 3 e 4 (v' + x/2) 


.02 04 (v')j le 2 9 x {V + x/2)_ 

Putting everything together, we find 

dp #,'(0) *J—0\(v + v' + x/2)6\(v — v' — r/2) 


dv 9\{v' + x/2) 


0i(v) 


(10.5.7) 

(10.5.8) 

(10.5.9) 


which is identical to Eq. (10.5.3), as desired. Because Eq. (10.5.5) is a hyper- 
elliptic map, it is formed by sewing two Riemann sheets together across two 
cuts. 

To conform with the usual string world sheet, we will parametrize the surface 
by two variables, t, which represents the circumference of the loop, and 9, 
which represents the twist angle within the loop. We now place the constraints 
(T = t + id): 


lo dZ> Jz( 1 -z)(* -z)’ 


(10.5.10) 


where t e [0, oo] is the length of the loop, while 0 e [—n/2, n/2] is the twist 
angle. 

To study the modular properties of this map [Eq. (10.5.5)], we must deter¬ 
mine the region in r space that corresponds to the tadpole moduli space given 
by 


0 < 6 < 27t, 

0 < T < oo. 


(10.5.11) 


The problem arises in the dangerous region T -* 0, which corresponds to 
r 0. We will be interested in the regions x —> 1 and y 1. By a direct 
power expansion of Eq. (10.5.10) for small x, we can show 

T (x) ~ -y Yn[x-l) + 0 [(x ~ 1)ln( * " 1)] ‘ (10-5.12) 

Now, let us expand r as a power of x. We know that the period matrix r can 
be written in terms of the integrals over the first Abelian differentials, as in 
Eq. (10.4.2). The hyperelliptic surface is created by gluing two sheets together 
along the cuts 0 to 1 and also between x to oo. The holomorphic differential 



340 10. Nonpolynomial String Field Theory 


is then dz/y. By taking the integrals over the cycles, we find 


r 


(JL>2 ’ 


0)1 




dz 

y 


where y 2 = z(z — l)(z — x) as in Eq. (10.5.5). 

We can write the relation between jc and r explicitly 


*00 = 


e 3 4 (Qlr) 

^(0|r)’ 


Power expanding in small x, we find 


(10.5.13) 


(10.5.14) 


™ ~- ln[(,'-l/16] + - 1)/ln<1 - »1 (1 °- 515) 

Comparing Eqs. (10.5.12) and (10.5.15), we find that the final result linking 
T and x is [19]: 

2i , 

r - T{ r) (10.5.16) 

71 

for small T . This is bad for modular invariance. Because moduli space [Eq. 
(10.5.11)] includes afinite region around T ^ 0, this also means that it includes 
a finite region around r ~ 0. But, the origin of the x plane includes an infinite 
number of copies of the modular region, and hence Eq. (10.5.11) (which comes 
from string field theory) maps into an infinite number of copies of the single¬ 
loop amplitude. 

This unfortunate situation is not just particular to the one-puncture graph. 
For example, for the map of Eq. (10.4.14), we can also take the case of g = 1 
and p = 2. By changing variables to hyperelliptic coordinates, we find that 
the map is given by [18]: 


dp _ v y(z-yXz-y) 

dz (z + y 2 Wz(z - l)(z -x) 


(10.5.17) 


For different parameters, this map will correspond to the three different di¬ 
agrams that one can write for a one-loop tadpole with two punctures. The 
circumference of the loop, the length of the stem of the tadpole, the twist an¬ 
gle, etc., can be formed by fixing the various line integrals along the cut. A 
careful analysis of this graph shows that when the length of the neck of the tad¬ 
pole graph becomes infinite, we reproduce the original tadpole map with one 
puncture, which is known to violate modular invariance. Thus, this violation 
of modular invariance by tadpoles is a persistent feature of the action. 

In sum, we find that our original nonpolynomial action successfully repro¬ 
duces all tree graphs for the closed string theory, but fails at the loop level. 
This means that the action is really a classical action and that a fully quantized 
action must be modified by a second series of nonpolynomial graphs, which 
contain loop contributions that kill the tadpole divergences. 



10.6 Summary 341 


Furthermore, we can also show that the functional measure D4> is not in¬ 
variant under the gauge variation [Eq. (10.3.11)]. In other words, the original 
action [Eq. (10.3.24)] suffers from two problems, the lack of modular invari¬ 
ant multiloop graphs and the presence of anomalies in the variation of the 
functional measure. Actually, this problem is really a blessing in disguise. By 
adding a second nonpolynomial series of terms into the action, we can kill both 
problems [21]. 

At first, this may seem strange. The presence of anomalies in the functional 
measure is a local property of the field theory, while the lack of modular invari¬ 
ance is a global property of the modular group. It seems strange that by adding 
a second series of nonpolynomial terms that we can kill both contributions. 

The origin of this is because the breakdown of modular invariance creates 
an infinite number of copies of the loop graphs, which in turn is responsible for 
the ultraviolet divergences of the theory. However, as shown by Shapiro, one 
can simply take one cover of moduli space and thus eliminate the ultraviolet 
divergences. Thus, the lack of ultraviolet divergences in string theory lies in 
modular invariance. Similarly, the presence of anomalies in the functional 
measure also arises from ultraviolet divergences. We find, therefore, that both 
the presence of anomalies and the lack of modular invariance have the common 
origin, the ultraviolet behavior of string theory. Thus, it is not surprising that 
a second nonpolynomial set of terms is required to make the action complete 
and that it can solve both problems simultaneously. 


10.6 Summary 

The ease and elegance with which the covariant open string field theory could 
be written has to be balanced against the difficulty and frustration faced in 
attempts to construct the covariant closed string field theory. However, since 
the heterotic string is a closed string theory, it is important to solve this pressing 
problem. The origin of this problem lies deep within a classical mathematical 
problem, the triangulation of moduli space. After a century of unsuccessful 
attempts to find specific coordinates that could cover moduli space once and 
only once, only three triangulations have been discovered, and two of them 
arise naturally from string field theory. 

The old light cone field theory, for example, successfully triangulated moduli 
space with only cubic actions. However, attempts to repeat the success of the 
open string field theory (with fixed parametrization length) failed. Specifically, 
the four-point scattering amplitudes created by gluing three symmetric closed 
string vertices together failed to cover the Koba-Nielsen space. 

The missing region can be seen to have the topology of a tetrahedron. If 
dij is the length that adjoins the /th and yth legs of this tetrahedron, then we 
have six unknown a tj and four constraints (because the circumference of each 
closed string equals In'). The perimeter of this missing region is represented 



342 10. Nonpolynomial String Field Theory 


by the motions one can execute on a Rubik’s cube, that is, by making rotations 
around the horizontal x or vertical z axis 


Cl\2 ~ ft, Cl 14 — 71 ~ 

an —ft 0, ai4 = 7r, 

#12 + #14 = ft • 


( 10 . 6 . 1 ) 


( 10 . 6 . 2 ) 


The missing region is the triangular region contained within these boundaries 

i #12> #13 — ft's /i a /^\ 

a 12 + a 13 >;r. (10 - 6 ' 2) 

It is easy to see that the missing region smoothly connects the s -, r-, and u- 
Mandelstam channels together, in the same way that the open four-string graph 
smoothly connected the t - and w-channel graphs of the Veneziano amplitude 
in the light cone gauge. 

By explicit calculation, we can then divide the Veneziano amplitude into 
several pieces, corresponding to scattering in the various Mandelstam channels 
and the missing region (which has no poles) 


A(s, t, u) = ^ A 22 


r AL + (2 3) + (2 4* 4) + M R (s, t , u). 


(10.6.3) 

Not surprisingly, this missing region persists for the higher point functions. 
The five-point fimction has the missing region 

I #12 ^ #35 ^ #12 + #24 ^ #35 + ft , 

#13 5 #24 5: #13 + #35 5: #24 + ^» (10.6.4) 

#12 +#13 < ft. 

For higher point graphs, we must add one more constraint. If we slice the 
polyhedron in half into two smaller polygons, then P/, the perimeter of each 
polygon, must satisfy 

pj= a v- (10 - 6 - 5) 

i EL',j€:R 

(We need this constraint in order for the missing region to adjoin various 
Mandelstam channels, since each channel contains a Pi equal to 2ji.) This 
missing region also shows up explicitly in the conformal map we use to map 
the complex plane to the world sheet of the closed string scattering amplitudes 


dp _ rfci 2 [(z ~ w»)(z ~ “>,)] 

dz njlifc - Yj) 


( 10 . 6 . 6 ) 


This map contains square roots, so the Riemann cuts correspond to the 
overlap between three closed strings. The counting of Koba-Nielsen variables 
for the tree diagrams comes out correct 


Koba-Nielsen variables = 2N — 6 = 


E-N, 
#(Yi ) - 6, 


(10.6.7) 




10.6 Summary 343 


while for the unknowns/constraints, we have 

4 N — 6 unknowns: (N, Zi\ 

r , (10.6.8) 

4 N — 6 constraints: [/>(/;), Re p(zt) the samej. 

The action for the closed string field theory has several differences between 

the open string case: 

(1) Ghost counting comes out incorrectly, which necessitates inserting a ghost 
factor into the free action. 

(2) The constraint (L 0 — L 0 )|^) must be added in by hand (or the projection 
operator must be inserted everywhere in the action). 

(3) There is more than one polyhedron at each level. 

(4) The vertex function is not BRST invariant, so we cannot use the 
cohomology axioms found for the open string field theory. 


As a first guess, let us postulate the action 

00 

L = (*IG*) + £a I1 <'l'"> (10.6.9) 

n =3 

which we demand is invariant under 

oo 

S|'I/> = e|A> + £j8 n hrA). (10.6.10) 

n =1 

Inserting the variation into the action, we find that the resulting terms must all 
vanish, which reduces to the following: 

n-2 

(-1)"('I'”11<2A) +n(<2'I'||'I'"- 1 2 3 A> + ^C"('I'"^||'1' P A) = 0, (10.6.11) 

p=l 

where the unknowns are now encoded within C n and c(n)i. 

After a lot of hard work, we find that the final action is quite simple 


oo 

L = i(vl'|(2vi/) + ^ 

n =3 


in 

s(n)j ’ 


( 10 . 6 . 12 ) 


that is, the overall coefficient of each graph (modulo rotations) is equal to 1! 

To generalize these results to loops, we want a conformal map that satisfies 
the following properties. We want: 

(1) a conformal map whose square transforms as a quadratic differential; 

(2) a map whose only singularities are double poles at p points y i9 and whose 
only zeros are 2p 4- 4g — 4 points v, ; and 

(3) a function defined on the Picard torus, so that the point z going around any 
a cycle or b cycle will return back to the same point. 



344 10. Nonpolynomial String Field Theory 


Fortunately, we can construct periodic and quasi-periodic functions on the 
genus g Riemann surface using © functions, which are defined as 

©(z|fi) = ^ exp (i7tn T Qn + 27r/n 7 z), (10.6.13) 

neZ8 


where n labels an integer-valued g-dimensional vector defined on the lattice 
formed by unraveling the Riemann surface by cutting along the various cycles. 
In terms of the prime form E(z, z')> the a function, and the © function, we can 
write the complete conformal map: 



x 


FI ]t + X A E^vAQ.) 

uu y j\ Q ? 


k(z)| 3 . 


(10.6.14) 


There are problems, however, at the one-loop level. Tadpole graphs (which 
do not appear in the light cone theory because of momentum conservation) 
appear in the nonpolynomial theory. By taking g = 1 and p = 1 in the 
conformal map, we have 


dp _ y/z-y 
dz v/z(z — l)(z - x) ’ 


N — - 

iv 2 . 


(10.6.15) 


Unfortunately, this map shows that modular invariance is violated. If we define 
T to be the circumference of the loop in the tadpole graph, then we find 


z ~ —T(r). (10.6.16) 

7t 

Since the region around complex T is part of the parameter space of the field 
theory, this means that the field theory includes a finite region around r ~ 0, 
which is known to have an infinite number of copies of moduli space. 

This means that our original action is actually only a classical action and 
that a second set of nonpolynomial terms must be added that can serve two 
functions. This second set: 


(1) kills the anomalies that arise when we take the gauge variation of the 
functional measure D'h; and 

(2) eliminates the overcounting due to tadpoles. 

In summary, we now have a successful closed string field theory that is 
the generalization of Einstein’s equations when power expanded around the 
graviton. The nonpolynomial action reproduces all tree graphs by filling up 
all missing regions of moduli space, but it has to be supplemented by a sec¬ 
ond set of nonpolynomial terms that can kill all unwanted divergences and 
overcountings. 

The difficulty of writing the nonpolynomial theory, however, leads us to 
suspect that it is the gauge fixed version of a higher theory. Since reparametriza- 



10.6 Summary 345 


tions lie at the heart of string theory and since the nonpolynomial theory breaks 
reparametrization invariance (since the parametrization length of the string is 
2n), we are led in the direction of trying to gauge reparametrizations. 


References 


1. M. Kaku and K. Kikkawa, Phys. Rev. DIO, 1110, 1823 (1974). 

2. E. Witten, Nucl. Phys. B268, 253 (1986). 

3. J. Lykken and S. Raby, Nucl. Phys. B278, 256 (1986). 

4. A. Strominger, Phys. Rev. Lett. 58, 629 (1987); Nucl. Phys. B294, 93 (1987). 

5. M. Kaku, in Functional Integration, Geometry, and Strings , 25th Karpacz Winter 
School, Feb. 20-Mar. 5, 1989, Z. Haba and J. Sobcyk, eds., Birkhauser, Basel 
(1989). 

6. M. Kaku, Phys. Rev. D41, 3734 (1990); Osaka preprint OU-HET 121 (1989). 

7. T. Kugo, H. Kunitomo, and K. Suehiro, Phys. Lett. 226B, 48 (1989); T. Kugo and 
K. Suehiro, KUNS 988 HE(TH) 89/08 (1989); M. Saadi and B. Zwiebach, Ann. 
Phys. 192, 213 (1989). 

8. M. A. Virasoro, Phys. Rev. 177,2309 (1969). 

9. J. Shapiro, Phys. Lett. 33B, 361 (1970). 

10. M. Kaku and J. Lykken, Phys. Rev. D38, 3067 (1988); [The missing region was 
first conjectured in: S. Giddings and E. Martinec, Nucl. Phys. B278, 256 (1986).] 

11. M. Kaku, Phys. Rev. D38, 3052 (1988). 

12. D. Mumford, Tata Lectures on Theta , Birkhauser, Basel (1983). 

13. J. Fay, Theta Functions on Riemann Surfaces , Lecture Notes in Mathematics, 
Vol. 352, Springer-Verlag, Berlin (1973). 

14. L. Alvarez-Gaume, G. Moore, and C. Vafa, Comm. Math. Phys. 106, 1 (1986). 

15. E. Verlinde and H. Verlinde, Nucl. Phys. B288, 357 (1987). 

16. S. Samuel, CCNY preprint (1989). 

17. L. Hua and M. Kaku, Phys. Rev. D41, 3748 (1987). 

18. L. Hua and M. Kaku, Phys. Lett. 250B, 56 (1990). 

19. G. Zemba and B. Zwiebach, J. Math. Phys. 30, 2388 (1989); H. Sonoda and B. 
Zwiebach, Nucl. Phys. B331, 592 (1990); B. Zwiebach, Phys. Lett. 241B, 343 
(1990); B. Zwiebach, MIT-CTP 1909, 1910, 1911, 1912. 

20. M. Saadi, Mod. Phys. Lett. A5, 551 (1990). 

21. M. Kaku, Phys. Lett. 250B, 64 (1990). 



CHAPTER 11 


2D Gravity and Matrix Models 


11.1 Exactly Solvable Strings 

To any finite order in perturbation theory, one does not see any of the in¬ 
teresting nonperturbative properties of gauge theories, such as confinement, 
tunneling, formation of strings, etc. As a consequence, two approximations 
have been developed, large N methods and lattice gauge theory, to analyze 
gauge theories in the nonperturbative regime. However, both approaches are 
still in their infancy, and neither has given us definitive results. 

The same situation may eventually apply to string theory. Continuum meth¬ 
ods, such as string field theory, are still much too difficult, with too many 
degrees of freedom, to analyze nonperturbative string phenomena. However, 
lattice models have emerged as a surprisingly simple way in which to extract 
nonperturbative information from strings, using a discrete approximation to 
the Riemann surfaces of string theory. 

The idea behind this nonperturbative approach is simple, and we can proceed 
in at least two ways. The first approach is to place the original Polyakov action 
on a lattice, where we discretize the two-dimensional world sheet. Since the 
lattice is two dimensional, it can be analyzed either analytically or by computer. 

The second approach is to use matrix models [1—10]. The essential break¬ 
through in matrix models is that there is a well-defined limit in which a certain 
class of solvable point-particle gauge theories can approximate the dual string 
theory for dimensions less than or equal to one. In fact, not only do matrix 
models allow us to use ordinary point-particle Feynman diagrams to reproduce 
string theory amplitudes, they also allow us to calculate all Green’s functions 
to all orders in perturbation theory. 

To see how point-particle gauge theory can miraculously reproduce string 
theory, we begin by studying a class of Feynman diagrams called fishnets [11]. 




11.1 Exactly Solvable Strings 347 


Given a point-particle scalar (f> n theory, the fishnet diagram has the topology 
of a two-dimensional lattice, where the lines within the lattice are given by 
Feynman propagators A. The fishnet consists of a simple product of Feynman 
propagators connecting the points of a lattice, which can be exponentiated 

PJ A[(xf - x^) 2 ] = exp ^2 In A[(xf - xf) 2 ]. (11.1.1) 

ij ij 

For a fishnet, the points Xi and Xj appearing in the product are neighboring 
points, so we can power expand 


(4-xj) 2 


(dx* 2 dx» 2 
\da dr 




( 11 . 1 . 2 ) 


where € is a small parameter measuring the distance between neighboring 
points. Notice that we have made the transition from i and j, which are dis¬ 
continuous coordinates which label the neighboring points of the fishnet, to 
continuous coordinates a and r, which parametrize the two-dimensional world 
sheet. 

Now assume that we can factor out the light cone singularity A(0) which 
appears in the Feynman propagator when we power expand the logarithm: 

In A[(x, - xj) 2 ] ~ In |^A(0)(1 + ~ *j) 2 + • • • j • (11.1.3) 

With this important assumption, we can write the product over Feynman 
propagators as a surface integral over the two-dimensional worldsheet: 


/ n dx k n -► J Dx v exp ^-k J da dr {^- + ^ , 

(11.1.4) 

where e 2 ~ dadz and k is a constant. 

Thus, assuming that we can remove the light cone singularity, the fishnet 
diagram smoothly approaches the familiar functional integral over a Riemann 
surface. The key step was replacing the Feynman propagators defined over a 
fishnet with a Gaussian, which is only possible if we can throw away the light 
cone singularity. 

By itself, however, this observation, while interesting, is essentially useless, 
since fishnet diagrams do not necessarily dominate the S matrix of field theory. 
Thus, although fishnet diagrams may have interesting properties, in general 
they have nothing to do with the final scattering amplitudes, which are the 
only physically relevant quantities. 

However, the important exception to this are gauge theories, where it is 
possible to make a power expansion in some parameter in which these fish¬ 
net diagrams do, indeed, dominate the 5 matrix. As noticed by’t Hooft, this 
important parameter is 1 /N 2 appearing in SU(N ) gauge theory [12]. 

For example, in SU(N) gauge theory coupled to quarks, the parameter N in 
a Feynman diagram only appears when we contract a series of delta functions 



348 11. 2D Gravity and Matrix Models 


onto themselves in a loop, that is, £Sj = N. Since the vertex functions of 
gauge theory consist of a series of delta functions, the parameter N appears 
whenever we contract these delta functions around a loop. 

Thus, a large fishnet diagram is proportional to 

A^(g) Vi+2V4 N ! , (11.1.5) 

where g is the coupling constant, V t is the number of /-point vertices appearing 
in the Feynman diagram, and 7 is the number of gauge loops. 

We now use a classical result due to Euler, who showed that a polyhe¬ 
dron with P edges, V comers, F faces, and 77 holes satisfies the following 
topological relation 


V — P + F = 2 — 277. (11.1.6) 

We now treat a fishnet Feynman diagram as a large polyhedron. In the 
language of Feynman diagrams, P becomes the number of propagators, V 
the number of vertices, F the number of loops, 77 the number of holes in the 
fishnet, and F = L + 7, where L is the number of quark loops in the Feynman 
diagram and 7 the number of gauge loops. Using the relation 2 P = ^nV n 
and V = V„, then we h ave 

A - (g) 2P ~ 2V N L - {g 2 N f/2)V^ (N) 2-2H- L ^ (1LL?) 

Now take the limit N -> oo, g 0, and g 2 N const. We find that the 
leading diagram which survives has 77 = 0 and L = 1, that is, they are planar 
diagrams with the quark line surrounding the edge of the diagram. 

Furthermore, if we take the limit of small but finite 1 / N 2 , then we find that 
a power expansion in 1 /TV 2 is equivalent to a power expansion in the number 
of loops in the fishnet diagram. The usefulness of the 1 /N 2 power expansion, 
therefore, is that it converts a point-particle perturbation expansion into an 
expansion over the genus of a Riemann surface found in string theory. 

In actual practice, however, Yang-Mills theory is quite complicated, so we 
will find it more convenient to solve a simpler problem, a theory where the 
fundamental field is a Hermitian N x N matrix called M. These are called 
matrix models. 

The advantage of matrix models is that they are so simple that they can be 
solved exactly in the large N limit and shown to be equivalent to string theory 
for dimensions less than or equal to one. (More precisely, by D < 1 string 
theory, we actually mean two-dimensional gravity in zero dimensions, without 
the presence of any string variable, coupled to c < 1 conformal matter.) 

Although the large N power expansion is a perturbation series in the genus 
g of the surface, certain recursion relations exist that allow us to find exact 
solutions for the models, giving us, for the first time, nonperturbative informa¬ 
tion about the theory. Thus, these models are so simple that the transition from 
a perturbation series in two-dimensional surfaces to a fully nonperturbative 
theory is not such a great barrier. 



11.2 2D Gravity and KPZ 349 


We will see that matrix models in the “double scaling limit” are exactly 
solvable for c < 1. This limit means taking N oo and carefully adjusting 
the cosmological constant (related to the area of the surface) to be at a critical 
point of the theory. These critical points, in turn, are indexed by an integer 
k. We will find that, for k — 2, the matrix model approximates ordinary two- 
dimensional gravity. For k — 3, 4,..., we find that matrix models approximate 
two-dimensional gravity coupled to nonunitary, conformal matter. For the gen¬ 
eral case of multimatrix models, we can reproduce two-dimensional gravity 
coupled to the (BPZ) minimal series of (p,q) conformal matter. 

The attractive feature, then, of the lattice and matrix model approaches is 
that they provide nonperturbative information about string theory using old, 
well-established methods. For the first time, nonperturbative features of string 
theory are emerging (such as the instability, that is, non-Borel summability, of 
perturbation theory). 

The disadvantage of these models, as we have pointed out, is that they cannot 
realistically describe string theories in 26,10, or 4 dimensions. There are certain 
mathematical barriers preventing analysis beyond one dimension. In fact, we 
will see that these models can only be solved for low, unrealistic values of the 
dimension. Beyond the c — 1 barrier, many of our approximations break down 
(e.g., real constants become complex and potentials become unphysical). 

In addition, matrix models only have a finite number of degrees of freedom, 
and hence we can usually find exact results. Beyond c > 1, the system has an 
infinite number of degrees of freedom, and we do not expect the system to be 
exactly solvable. Thus, in some sense the price we pay for exact solubility of 
the string theory is low dimensionality and finite degrees of freedom, which do 
not describe the real world. However, there is hope that these theories, being the 
first to provide nonperturbative information about strings, will give us valuable 
insight into this previously forbidden yet crucially important region. 


11.2 2D Gravity and KPZ 

Usually, string theory is defined at the critical dimension of 10 or 26, where the 
theory is scale invariant and the metric g ab can be set to S ab . For values of the di¬ 
mension other than 26 or 10, we have the noncritical string theory of Polyakov, 
where the string variable X M (<r, t ) interacts with the two-dimensional metric 
tensor g ab . For noncritical string theory, there is a conformal anomaly, so the 
three fields within the metric cannot be eliminated totally. The metric reduces 
to just one field, the Liouville field 0, where g ab = e^h ab - 

Usually, for critical strings in 26 or 10 dimensions, we have the freedom to 
eliminate the Liouville field completely. However, in these low-dimensional 
matrix models, which are noncritical, we must leave the Liouville mode intact. 
Thus, in this off-critical picture, the two-dimensional gravitational field does 



350 11. 2D Gravity and Matrix Models 


not vanish but becomes a key player, and the string field actually reduces to 
matter fields coupled to two-dimensional (Liouville) gravity. 

We begin by isolating the contribution of the conformal anomaly, which 
we will calculate by taking the variation of the functional measure under a 
scale transformation. First, the X M integration gives us the determinant of the 
Laplacian 


= I DX e xp(-j d 2 Zy/gg 
2n 


daX^dbX, 


•) 


- de,< - v2) ] 


-D/2 


( 11 . 2 . 1 ) 


We must calculate how this term in the functional measure transforms under a 
scale transformation of the metric in order to isolate the conformal anomaly. 
We can use heat kernel methods to calculate the change in the determinant of 
the Laplacian under a scale transformation. The result is 

D e o g X = e {DIA%7,)SL(a ' g) D g X 

f , . x (11.2.2) 

SlOt, g) = I d 2 z«fg (i g ab a a or d b a + Ro r + Ixe°). 

The action S L is called the “Liouville action.” The classical equations of motion 
for the a field are of form d a d a cr ~ e° and are quite difficult to solve. Under 
this scale transformation, we also find that the b , c ghost system is not invariant, 
but transforms nontrivially under a scale transformation. 

We will define 


J D g b D g c — 


j A fp Db Dc , 


(11.2.3) 


where A FP includes the metric-dependent and b, c-dependent factors, that is, 
the Faddeev-Popov terms. Using heat kernel methods, we find that this term 
transforms as the following under a scale transformation 

D e o g b D e og C = e { - 26/4S7T)s ^ 8) D g b D g c. (11.2.4) 


When we multiply the two contributions Eqs. (11.2.2) and (11.2.4), together, 
we notice that the action Sl occurs with the coefficient D — 26: 

[(D-26)/4&r]SL(or,*), (11-2.5) 

which vanishes when D = 26. This is the choice usually taken in string theory, 
which is called critical string theory, where the two-dimensional metric can 
be eliminated entirely. Thus, in 26 dimensions, we never have to worry about 
the Liouville degree of freedom contained within the two-dimensional metric 
tensor. 

However, since D — 26 string theory is too difficult to solve, we will be 
interested in solving simpler theories, such as the case of low dimensions. In 
this chapter, we will explore ways to solve this problem exactly. From the 



11.2 2D Gravity and KPZ 351 


perspective of conformal field theory, we are interested in studying the case 
of two-dimensional gravity coupled to D identical copies of conformal matter 
with weight 0 (i.e., the string X M ). 

One complication, however, is that in lower dimensions, the conformal 
anomaly, Eq. (11.2.5), does not disappear. This means that the difficult Li- 
ouville mode a must be carefully included in all our discussions. However, it 
is still possible to extract information about the theory exactly. For example, 
from the work of Knizhnik, Polyakov, and Zamolodchikov (KPZ) [13, 14], 
we know the asymptotic form of the partition function for two-dimensional 
gravity. If the area A of the two-dimensional world sheet is defined as 

A = j d 2 % 1 /det g ab , (11.2.6) 

then the partition function is 

Z(A) = j DX(i;) J Dg ab (i;)8 (f d 2 %^fg — a'J 

x exp J d^Jgg^daX^ d b X^ . (11.2.7) 

Using light-cone quantization of the two-dimensional theory, KPZ found 
that the partition function behaves asymptotically as follows as the area A 
approaches infinity 

Z(A) ~ A~ 3+y exp(£A), (11.2.8) 

where y, the string susceptibility, can be shown to equal [13, 14]: 

y = ±[D- 1-V(£>-!)(£>-25)]. (11.2.9) 

Although this form of the string susceptibility was originally derived in the 
light cone gauge, it can also be derived in the conformal gauge [15,16]. 

To see this, we first assume that after regularization the final result for the 
Jacobian after a rescaling is equal to 

s (<fi,g)=^ j d 2 z(y/Ig ab da4>d b <p-\Q^R(p + ix (11.2.10) 

where we have rescaled g = e a< ^g. We have chosen this expression, involv¬ 
ing the undetermined Q , a, fi\ 9 because it is the most general form of the 
regularized action consistent with the symmetries of the system. 

Actions of this type (with the crucial factor containing Q ) often occur when 
we bosonize a fermionic system, as in the Feigin-Fuchs free field formalism 
in Eq. (7.1.2). It is easy therefore to calculate the contribution to the anomaly 
of the (f> field, which is 


Cfj, — 1 3 Q 2 . 


( 11 . 2 . 11 ) 



352 11. 2D Gravity and Matrix Models 


Imposing the fact that the entire system has zero anomaly, we find that the 
anomaly is the sum of three terms, all of which add up to zero 

c = c 0 + £>-26 = O (11.2.12) 

or 

Q = 7(25 - D)f 3. (11.2.13) 

The next step is to calculate the value of a. This is easy because we demand 
that g = e a<i> g be invariant, which means that e a<t> has conformal weight 1. 
Again, from ordinary conformal field theory, we also know how to compute 
the conformal weight of operators such as e a<l> . If the energy-momentum tensor 
is given by 

T zz = -^8<pdcj> + Qd 2 <l>), (11.2.14) 

then the conformal weight of e q<f> is 

vn(e«*)=-\q{Q+q). (11.2.15) 

Setting this equal to 1, we then find 

a = (—i V3 )[V25 - D ± V(1 - D) ]. (11.2.16) 

Using the explicit values for Q and a, let us now calculate the behavior of 
Z(A) under a rescaling. If we shift by a constant value 

<fi —> <j> + p/ct, (11.2.17) 

then the action of Eq. (11.2.10) shifts by the integral of the curvature tensor R 
over the Riemann surface 


S-Q{ 1 — h)p/ot, (11.2.18) 

where h is the genus of the surface. The shift in the S function is given by 

e tt *4id 2 z - /) -► e~ p 8 e a<t, J§d 2 z - . (11.2.19) 

Putting everything together, we find that the partition function scales as 

Z(A) ~ (11.2.20) 

so that the susceptibility given in Eq. (11.2.8) must therefore be [16]: 

y(h) = ^(1 - h)(D - 25 ± V25 -D vT^Td) + 2. (11.2.21) 

Setting h = 0, we retrieve the original result of Eq. (11.2.9). 

Notice that the string susceptibility becomes complex for the dimension D 
between 1 and 25, meaning that attempts to naively extend the matrix model 
approach beyond D = 1 will inevitably have severe problems. This is, in 
fact, perhaps the most important roadblock facing this formalism, preventing 
a realistic, nonperturbative formulation of string theory. 




11.3 Matrix Models 353 


However, assuming that this low-dimensional barrier can be eventually sur¬ 
mounted, these matrix models may make the transition from being simplistic 
“toy” models to being relatively realistic nonperturbative descriptions of string 
theory. The most optimistic outcome would be that string theory might be de¬ 
fined to be the large N double scaling limit of matrix models, thereby replacing 
the Riemann surface description. 

To begin our discussion, we note that there are two ways in which to proceed. 
First, we can work directly with the Polyakov functional, either analytically or 
by computer. For example, on computer, we can approximate the functional as 



( 11 . 2 . 22 ) 


where we sum over all triangulations S of the surface, i and j label the vertices 
on the surface, /3 is the inverse Ising temperature, and h is the magnetic field. 
However, we will explore the second approach, matrix models, which yields 
the exact analytic solution to the problem for c < 1. 


11.3 Matrix Models 

The second way to proceed is to make the connection between the two- 
dimensional gravitational world sheet and the topology swept out by the 
Feynman diagrams of a matrix model. The correctness of this approach will be 
evident when we calculate the string susceptibility y and independently check 
Eq. (11.2.21). In this spirit, let us analyze the simplest description of these 
matrix models, defined in terms of a field M, which is an N x N matrix. 

We start with the action 

D 

L = ^ Tr(dnM d^M^) + Tr(MM 1 ') + — (11.3.1) 

pi=l 

We can set = M and also generalize to an interaction with arbitrary powers 
of the matrix M : 

oo 

U = =S 3 TrM 3 +g 4 TrM 4 + --- (11.3.2) 

i=3 

so the generating functional is given by (for D = 0): 

Z(/J) = / DM exp [ — /J Tr [/(A/)]. (11.3.3) 

Let us now analyze the Feynman diagrams emerging from this Lagrangian. 
In general, a diagram will have P propagators, V vertices, and I closed loops. 
As before, we have the Euler relation: V — P + I — 2 — 2H where H is 
the number of holes in the surface on which the polyhedron is drawn. Notice 



354 11. 2D Gravity and Matrix Models 

that this number is a topological invariant, dependent only on the topological 
nature of the surface upon which we draw the polyhedron. 

For the Feynman diagrams arising from a Hermitian matrix model, for fixed 
P/N, each vertex, propagator, and loop contributes the following factors: 

vertex —> TV, 

propagator 1 /TV, (11.3.4) 

loop -> TV. 

Using Eq. (11.3.3), by multiplying these factors in a Feynman expansion, we 
arrive at 


InZ ~ TV 2(1-tf) (TV//j) A , (11.3.5) 

where A is the area of the random surface. 

Then, we see that the vacuum energy graph, for example, divided by N 2 has a 
finite limit in the planar (H = 0) limit. We see that the overall factor is damped 
by a factor 1 /N 1H , so that a perturbation in 1 /N 2 is actually a perturbation in 
the number of holes in the surface. This gives us the justification for comparing 
the matrix model in the large N limit with the string theory perturbation series. 
The interesting features appear when we take the double scaling limit 

N-> oo, P/N const. (11.3.6) 

In this limit, we find that the free energy is independent of many of the details 
of the nature of the potential U, that is, we find a universal scaling behavior 

In Z(p) -+ -F(r), (11.3.7) 

where the scaling variable is defined to be 

t = - N)P~ l/(2k+l \ (11.3.8) 

We will find that there are k parameters in the potential U(<f>) that we can 

adjust, giving us the multicritical behavior for the specific heat, defined by 

f{t) = d 2 F{t)/dt 2 . (11.3.9) 

Remarkably, we will be able to derive an exact expression for f(t) in terms 
of a differential equation, independent of the perturbation expansion. However, 
we can check the correctness of our results against the perturbation expansion, 
where the coupling constant is 


q 2 .-(2+1 /k) 

o string 1 


(11.3.10) 


To make this discussion concrete, we now use some tricks pioneered by Brezin, 
Itzykson, Parisi, and Zuber [17-20] to solve the matrix model. 



11.3 Matrix Models 355 


If we are, for example, interested in the vacuum energy, we wish to find a 
solution for the following path integral (in zero dimensions): 

exp[—Af 2 £ (0) ](g) = lim f Y\dM u 

N-+OQ ] A . A 
J IJ 

xexp-(±TrM 2 + £TrM 4 ). (11.3.11) 

Notice that we have dropped the kinetic term entirely, meaning that we are 
only analyzing the simplified case of D = 0. 

One crucial observation is that the functional measure over the matrix M: 

]"[ dMij = dMu ]“[ d {Re Mij)d {Im M u ) (11.3.12) 

ij i i <J 

can be simplified by diagonalizing the matrix M . Then, we are left with an 
integration over the eigenvalues of M and a matrix IT This means that we 
can write the measure in terms of the eigenvalues and the matrix FI: 

YJdM'j = Y\dXi [■[(*, - XjfdUij. (11.3.13) 

ij i i<j 

The advantage of this approach is that the action is written entirely in terms 
of traces over products of M, so that the dependence on the diagonalizing n 
matrix completely disappears. This means that we can trivially integrate over 
n, leaving only an integration over the eigenvalues : 

exp[—N 2 £ (0) (g)] = Jiin_ f ]~[(A, - A ; ) 2 

J i i<j 

We evaluate this integral by the method of steepest descent. The energy £’ (0) 
can now be explicitly calculated in terms of the classical variable A,. In terms 
of this classical variable, we find 

£( 0 ) (g) = Jp X) (i+ N~ ln l Xl ' _ k J I H 

_ i 

+ A,/2 + (2 g /N)k* = (11.3.15) 

where the prime means we do not sum over i = j. 

Let us now take the large N limit. We can replace the classical with a 
new, continuous variable A,(x), defined by 

kt = Vnw/n). 


(11.3.16) 



356 11. 2D Gravity and Matrix Models 


In the large N limit, we can then write the vacuum energy as 

£ (0) (#)= f dx[±X. 2 (x)+gk 4 (x)]-ti f'dxdy ln\k(x)-k(y)\ (11.3.17) 
Jo 

and k(x) obeys the constraint 

1 1 k(x) + 2gk\x) = P f dy (11.3.18) 

J 0 k(x)-k(y) 

where P represents taking the principle part of the integral. This last constraint 
is actually sufficient to determine the function A(.r), given some assumptions 
on its analytic behavior. 

To solve for k(x), we introduce two new functions, u(x) and F(x), defined 
via 


dx 

dk 


= u(k). 



dk u(k) = 1, 



«0) 
k — gL 


(11.3.19) 


(We will assume that u(k) vanishes outside some support [-2a, 2a]. We can 
determine F(k) by analytical arguments. We can show F(k) is analytic in the 
complex k plane cut along this interval, behaves as 1 /k when |a| goes to oo, is 
real for real k outside this interval, and when k approaches this interval, then 
F(k ± ie) = k/2 + 2 gk 2 =|= inu(k).) 

Using the analytic properties of these functions along the interval 
[—2a, +2a] and their behavior at infinity, we can obtain the solution [17]: 


F(k) = \k + 2gk 3 - (i + 4ga 2 + 2 gk 2 )Vk 2 - 4a 2 , 

1 _ _ (11.3.20) 

u(k) = — (i + 4 ga 2 + 2gk 2 )y/4a 2 - k 2 . 

71 7 

Plugging these expressions back into the formula for the vacuum energy, we 
find 


r*la 

E (0) (g) - E (0) ( 0) = / dk u(k)(±k 2 +gk 4 - 2 In k) — (g = 0) 
Jo 

= 1)(9 -a 2 )-± In a 2 , 


where 

« 2 = 2-[0+ 4 8g) 1/2 -i]. 

24 g 

Inserting the value of a as a power expansion in g, we now have 


(11.3.21) 

(11.3.22) 


OO 

E (0) (g) - E (0) ( 0) = - £(-12)' 

P =1 


( 2 P ~ 1 )! 
PKP + 2)! 



11.4 Recursion Relations 357 


= 2g - \Sg 2 + 288 g 3 - 6048g 4 + • • •. (11.3.23) 

Remarkably, the simplicity of the D = 0 matrix model gives us a simple 
expression for the energy. 

Now, let us find the critical point for this expansion and compare our results 
with the values given by KPZ. At the critical value, we can show that the 
vacuum energy goes as (fi — fi c ) 5/2 in the limit of infinite N (i.e., genus h of the 
surface is zero). In fact, by repeating the previous steps, a careful expansion of 
the vacuum energy as a function of 1 /N 2 shows that the next few terms in the 
Taylor expansion behave as ln(^6 — fi c ) and - f5 c )~ 5/2 for ft = 1 and h — 2, 
respectively. 

Our goal is to calculate the string susceptibility. We define the susceptibility 
as the second derivative of the free energy FQ8): 

x {P) = F f \P) (11.3.24) 

and the exponent of this susceptibility at the critical point is given by y : 

X ~{P-P C Y Y . (11.3.25) 

By inserting the behavior of F(/J) into this expression, we find that 

y = 2 - f (ft - 1), ft = 0, 1,2. (11.3.26) 

Now, compare the above expression for the exact result for the exponent 
of susceptibility [Eq. (11.2.21)] appearing in the KPZ formalism. We find a 
precise agreement for D = 0, which gives us great confidence in the power of 
the matrix models to approximate two-dimensional gravity. 


11.4 Recursion Relations 

At this point, these results have been interesting, but largely perturbative. If 
this were all that one could do with matrix models, then it would be very 
disappointing. The key, of course, is to extend our perturbative results to 
the nonperturbative regime. To accomplish this nontrivial feat, we will use 
the method of recursion relations, which will allow us to write differential 
equations that extend our analysis to a fully nonperturbative theory. 

To do this, we will introduce the method of orthogonal polynomials. As 
before, we notice that we can reduce any of these matrix models to an integra¬ 
tion over the eigenvalues A,,-, with a measure proportional to the Vandermonde 
determinant appearing in Eq. (11.3.13): 

N 

A M = — 

i<j 


(11.4.1) 



358 11. 2D Gravity and Matrix Models 


so that the partition function can be expressed entirely in terms of the 
eigenvalues 


Z(fi) = 


dki A 2 n exp 


/=! 




i =1 


(11.4.2) 


where represents the “temperature” of the Coulomb gas with a potential 
given by U(X). 

Now let us use the key step that will make the matrix models solvable. 
We will introduce polynomials P n (X) [18, 19], which are orthogonal with 
respect to the weight d/x(X) — exp [—>3C/(X)j dX, that is, these polynomials 
are functions of the potential U(X). By construction, these polynomials obey 
the relations 


j dX exp [-/J[/(A)]P n (A)P m (A) = 8 n , m h n . (11.4.3) 

For example, it is possible to write an explicit representation of these 
polynomials in terms of the potential U as 


P n (X) = Z~ l j f] dXi exp [-prnx^ix - X,)Al 
Z n = f P| dXi exp[—(A.,-)] A*. 


(11.4.4) 


(Notice that we obtain the full generating functional if we set n = N: Z N (fi) — 

zm 

It is possible to construct these polynomials so that they satisfy a recursion 
relation for any potential U(X): 


XP n (X) = P n+l (X ) + 5„P„(A) + RnPn-iiX), (11.4.5) 


where the polynomials h n , R n , and S n will be determined shortly. (S n can be 
set equal to zero because we assume that U(~4>) = U (—</>).) 

Now we wish to show that Z„ can be written entirely in terms of h,. 

We first notice that P„(X) = X n plus lower powers of X. Next, we notice that 
the integral of P„ times X‘ over d/i(X) is equal to zero if i < n. This means that 
we can always power expand P n and integrate over each power of X‘, where 
most integrals actually vanish. 

Using (11.4.4), we now observe that 

h„= J d,x(X)P 2 n (X) = j dfi(X)P n (X)X n 

= f 

J Ai i=1 



11.4 Recursion Relations 359 


/» w+l 

= [(n + 1 )Z n ]-' J J"J<^(A,)A* +1 


Wl 


(n + 1)Z„ 


(11.4.6) 


where we have set X = X„+i and have written A 2 n+l in terms of A^. 

Using the recursion relation Eq. (11.4.5), we can also show 

h n + 1 = j dX exp \—pU (A)] P n +\XP n 

— j dX exp [—^t/(A.)](P n+ 2 + S n +\P n +\ + R n +\P n )P n 

= Rn+ih n . (11.4.7) 

Using Eq. (11.4.6), we can now write the partition function entirely in terms 
of the functions 


N -1 

z N = ml\ hi . 

i=0 


(11.4.8) 


This is the desired expression. 

It shows that all the information concerning the system is encoded within 
the h„ , or within the R n and S n . The precise form of the potential U determines 
these polynomials, which in turn determines the partition function exactly. 

From these recursion relations, we can now actually solve for the free energy 
of the system. Because XP' n ~ nX n , we can rewrite the recursion relation as 


■ n = j dXe~ pu XP' n P n = J dke-WP' n {P n+x + R„P n -i) 

= R n J dXe~^ u P' n P n -\ = R„ j dXe^ u pU'P n P n . x , (11.4.9) 


where we have integrated by parts. Fortunately, the last integral can be 
performed exactly by using the explicit power expansion of the potential U : 


j d\e-* v P n U'P n -i = J dXe~ pu P„ 


^2(p + \)g p+ \X lp+{ 


[_p=0 


Pn- 


= h n £2(p + l)*, + , ,(11.4. 

_p =0 paths 


10 ) 


where we have expanded 

P =1 


and the sum over paths can be defined as follows. 


(11.4.11) 



360 11. 2D Gravity and Matrix Models 

Imagine a staircase which zigzags up and down, such that it begins at height 
(n -1) and ends at height n . The staircase has p+l steps that go up, and p steps 
that go down, so it has 2p + 1 steps altogether. There are (Ip + \)\/p\(p + 
1)! such possible staircases. For each of these possible staircases, associate a 
product of R's. A down step from k to k — 1 generates a factor of R k , and an 
up step generates the number 1. Thus, the sum over paths is the sum over all 
possible staircases in Eq. (11.4.10), such that each staircase is associated with 
a product of R’s that label the number of down steps from k to k — 1. 

For example, for p = 1, the sum can be represented as 

£ = R n -i+Rn + R n +1. (11.4.12) 

paths 

For p — 2, the sum becomes 

^ ^ = ^n-2^n-l R n -\ “I" 2R n _iR n 

paths 

+ R n -\R n+x + Rl +2R n R n+l + R 2 n+X + fl n+1 /?„ +2 .(11.4.13) 

Inserting the sum over paths into our original recursion relation, Eqs. (11.4.9) 
and (11.4.10), we now have 

- = 2 R n + l)^ + i £*«,••• V (11.4.14) 

^ p =0 paths 

Let us analyze this remarkable set of equations with a few examples. 
Example: k — 2 

For our first case, let us begin with the simple potential 

U = gl X 2 + g 2 X 4 . (11.4.15) 

We can now perform the sum over the paths explicitly. Using Eqs. (11.4.12) 
and (11.4.14), we obtain (for 1 < n < N and x — n/fi) [21]: 

X = 2R tt [ gl +2g 2 (Rn+l +Rn + Rn-l)l (11.4.16) 

Now, let us take the large N limit (for fixed x). in which case we have 

R n (x) = R(x)+[x(n-N)/N]R'(x)+^[x(n-N)/N] 2 R''(x)+- ■ ■ (11.4.17) 

and Eq. (11.4.14) becomes 

1 = 2 E L ^nr L s^' s ' +l = ”’(«• < n - 4 - 18 > 

p =0 \r''J 

In the double scaling limit, Eq. (11.3.7), we define the multicritical points 
x c and R, as the points where the following conditions are satisfied 

w'(R c ) = w"(R c ) = ■■■ = w^iRc) = 0 


(11.4.19) 



11.4 Recursion Relations 361 


for some integer k. Let us investigate the simplest case k = 2 (which will corre¬ 
spond to pure two-dimensional gravity). In this case, w(R)—x c is proportional 
to (R - R c f. 

Now we must calculate corrections to Eq. (11.4.18) to higher orders in 
1 /N 2 . By carefully calculating these higher-order corrections in Eq. (11.4.16) 
via (11.4.17), we have 

x = w(R) + N' 2 4g 2 x 2 RR"(x ) + 0(N~ 4 ). (11.4.20) 

At the multicritical point, we find that x — x c , w(R) — x c , and N~ 2 R" all have 
the same order of magnitude. 

We now introduce a function /, which is the scaling function. For R n near 
R c , we find that it approaches the following scaling limit 

R n -R c = N- 2/{2M) f{t\ (11.4.21) 

where t = (x c — x)N 2k ^ 2M \ 

Plugging all these values into Eq. (11.4.20), we find the following equation 
for the scaling function in the multicritical limit 

t = f 2 + f (11.4.22) 

whose solution is a Painleve transcendental of the first kind [21-23]. 

Example: k = 3 

For our next example, we will extend our results for the next multicritical point. 
For A; = 3, the calculation is only a bit more difficult (but for arbitrary k , we 
will have to use a different method). In this case, we can repeat the same steps 
as before 

U = g l X 2 + g 2 X 4 +g i X 6 . (11.4.23) 

This gives us [21]: 

w(R) = 2 gl R + 12 g 2 R 2 + 60 g 3 R\ (11.4.24) 

The tricritical point (it/ = w" = 0) gives us 

R c = -g2/l5g 3 . (11.4.25) 

In the large N limit, a Taylor expansion gives us 

X = w(R) + N~ 2 x 2 w"(R)R" /6 + 30N~ 2 x 2 R ,2 g 2 

+ N~ 4 RR""x\g 2 + 33g 2 R)/3 + 0(N~ 6 ). (11.4.26) 

At the tricritical point, the above equation becomes the following equation for 
the scaling function [21-23]: 

t = f + ff" + (f') 2 /2 + f" 710. (11.4.27) 

By comparing this equation and its critical exponents with those found for 
two-dimensional gravity coupled to (nonunitary) conformal matter, we can 
determine that the k = 3 case corresponds to the Yang-Lee edge singularity. 



362 11. 2D Gravity and Matrix Models 


For general values of k , we will shortly find that the multicritical matrix 
models represent two-dimensional gravity coupled to nonunitary conformal 
matter with 


C = 1 - [6 (p-qf/pq], 
where ( p , q) = (2k — 1,2), or 

, 3(2 k - 3) 2 

C _ 2k — 1 ' 


(11.4.28) 


(11.4.29) 


Using the method of orthogonal polynomials, we see that we have been able 
to find exact solutions for the scaling functions in the large N limit. 

However, we see that the equation for the scaling function for higher k 
becomes prohibitively difficult because of the sum over the staircases in Eq. 
(11.4.14) found in the recursion relations for the orthogonal polynomials. We 
will thus need a more powerful formalism. 


11.5 KdV Hierarchy 


Let us now switch to an operator language, which will simplify our calculation 
of the h n and allow us to generalize the previous results for arbitrary values of 
k [22-24]. In particular, we will see the (KdV) hierarchy emerge. 

Let us define the scalar product as 

(A\B) = J dXexp[- pU(X)]A(X)B(X). (11.5.1) 

For example, if we define the vector | n) via the polynomial 

Pn(X ) = Pn(X)/Jh n -► l«), (11.5.2) 


then we have 


(n\m) = 

From Eqs. (11.4.3) and (11.4.4), we have 

= S f z,m+l yjR-n 4” Rm &n,mSmi 

where the function k is now written as an operator k. 

Now consider the following relations: 


Jdn(X)P n (X)-^- P n = 0, 
fdfl(k)Pn-l 


h »-1 _ n 

h„ 


(11.5.3) 

(11.5.4) 


(11.5.5) 



11.5 KdV Hierarchy 363 


where dfi(X) = exp [ — /3f/(A)j dk. These relations, which follow from the 
orthonormality of P n , can be written as 


(n|f/ (A)|n) = 0, 

(11.5.6) 

{n-\\V\k)\n)=n/{fiyfR^\ 

where we have integrated by parts. The operator d/dX pulls down a factor 
of U'iX), which then becomes an operator. These equations contain the same 
information as Eq. (11.4.14). We now take the large N limit; this means that we 
will replace the discontinuous variable n with the continuous variable x = n/f} 
and replace R n by R(x). 

To solve the system of equations, Eq. (11.5.6), exactly in this limit, we must 
convert the bra vector {n — 11 into (n |. This is most easily done by introducing 
two conjugate operators h and 9 , such that {n\h = ( t n\n/p . We can express 
these conjugate operators as h - (- i/P)d/d9 and 0 = (i/p)d/dh. With 
these operators, we can show that {n — 11 = (n\e~ 1 ^. Now we can eliminate 
the presence of 9 by taking the large fi limit, where we can power expand e ~ l ®. 
The only remnant of this operator is a term d 2 /dh 2 , which can be rewritten as 
d 2 /dx 2 . 

Collecting all terms, we then find that we can rewrite Eq. (11.5.6) as 


xp = {x\U'(2-H)\x), 

Id 2 , , 


(11.5.7) 


where R t = 1 and we have introduced the effective Hamiltonian of the system 
H coming d 2 /dh 2 . 

We will find it convenient to define U' as the following: 

U(<t>) = B(\, -j - v)(2 - (fiy+W + ($ -0), (11.5.8) 


where B is the Euler beta function. (There is a certain amount of arbitrariness 
in this choice, since most of the details of the potential are washed out when 
we approach a critical point.) Written in this form, we can now insert the 
expression for U into Eq. (11.5.7), and we find 


jSjc = -2 vB{\,\-v){x\H v -V 2 \x). 
Rescaling x —► 1 — fj8 -2 */( 2 * +1 >, we can write 
t = -2 vB(\, \-v) (t\H v ~ (l/2) \t) 

TV-0/2) 




(11.5.9) 


= -2 vB(l I - v)(t\[-(d/dt) 2 + m] v - (l/2) 1 1 ) (11 5 10) 

A -n. / 1 1 \ f d(0 ,, /I /O'. / | 1 | \ 


\-(o + H I 

Fortunately, at this point we have reduced the problem to solving for the 
matrix element of the inverse of the Schrodinger operator H , which can be 



364 11. 2D Gravity and Matrix Models 


solved via the methods of Gelfand and Dikii [25]. We are interested in solving 
for the matrix element 




't —^ A. 

> t + tf / 


(11.5.11) 


The solution of this equation can be written in terms of the KdV hierarchy. For 
example, let us define the following operator: 


K[f(t), V,] = -1 ^ + f(t) + i-/(r) V ( . (11.5.12) 

Then, we can show that the matrix element of the inverse of the Schrodinger 
Hamiltonian can be given as 




R,(f) 

10 £' +1/2 ’ 


**(/) = {-£*[/(*). 

K + x = \R" ~ uRl ~ \u’Ri- 


(11.5.13) 


[To prove this, we note that R satisfies ( H + %) t R = (H + t=) t R = S(t — t r ). 
Therefore, c(§) = RR t f — R t R t > is a constant independent of t and t f . Given 
the behavior of R at infinity, we can set c(§) to zero. If we then set t = t\ 
we have the differential equation — R + 4 (u + §)/?' + 2 u'R = 0. Then Eq. 

(11.5.13) follows.] 

We now have all the tools necessary to solve for t . By inserting the expression 
for the resolvant R(t , f; §) into Eq. (11.5.10) for r, we find [22]: 

, = (2t-l)!! |g[/W - V , 1 l‘ 1 ' (1 '' 514) 

It is now straightforward to calculate the values of the various Ri from Eq. 

(11.5.13) , which can be written as 

*0 = 5. *!=-?/• 

*2 = U 3 f 2 ~ /")* 

*3 = —5j[10/ 3 - 10//" - 5(/') 2 + /""], 

*4 = 556 [ 35/ 4 - 70/(/') 2 - 10 f 2 f " 

+ 21(/") 2 + 28/7'" + 14//"" - /"""]. 


(11.5.15) 



11.5 KdV Hierarchy 365 


Expanding Eq. (11.5.14), we find 

k = 1: t = f, 
k = 2: t = 

k = 3: t = f - ff - \{f'f + 
k = 4: t = f 4 — 2/(/') 2 - 2/ 2 /" + §(/") 2 
+ 5/7 w + f// <4) -^/ (6) . 


(11.5.16) 


Notice that, for k = 2 and k = 3, we retrieve the results for Eqs. (11.4.22) 
and (11.4.27) (modulo trivial minus signs and rescalings) which were found 
by explicitly summing over the lower-order staircases in Eq. (11.4.14). (We 
can also check the correctness of these results perturbatively. We know that, 
in the planar limit, that is, genus 0, the derivative terms in the expansion of the 
Hamiltonian vanish. Setting all primes to zero in the previous expression, we 
find that 

lim f(t) = t l/k (11.5.17) 

^string ^0 

which is the correct perturbative expression.) 

Although we have found the differential equations that define the solution to 
our problem, we find that we need more data before we can uniquely calculate 
their behavior. This is because perturbation theory does not specify the initial 
conditions for these higher-order differential equations in Eq. (11.5.16). These 
initial conditions cannot be determined perturbatively, that is, they arise only 
because we have entered a fully nonperturbative domain of two-dimensional 
gravity. 

Now that we have found explicit relations that the scaling function obeys, it is 
also straightforward to calculate the correlation functions of the theory. We re¬ 
call that the presence of the potential U in the matrix model Lagrangian created 
a new function w(R) in Eq. (11.4.18) that determined the scaling properties of 
the free energy. 

More precisely, if the potential U was given by 

U(.<t>) = Y,U2k<t> lk (11.5.18) 

k 

for some coefficients U 2 k* then the corresponding w(R) is given by 



( 2 *)! 

*!(* - 1)! 


U lk R k . 


(11.5.19) 



366 11. 2D Gravity and Matrix Models 


This tight relationship between U and w(R) can be also be written in terms of 
line integrals 


^=fpi u i * + ?)• 

£/(0) = f — w[u(l — w)0 2 ]. 

Jo U 


(11.5.20) 


Near the k critical point, the k — 1 derivatives of w(R) vanished. Near this 
point, therefore, 


w(R) =\-(R c - Rf , 


(11.5.21) 


where we can set R c = 1. [At this point, we note that the potential E4 cor¬ 
responding to w at the critical point is a sum of factors (—1 ) /-1 </> 2/ , summed 
from / = 1 to k. This means that the pure gravitational case, corresponding to 
k — 2, has a potential that is not bounded. This case, strictly speaking, is not 
well defined. This means that we must modify the potential in this case, that 
is, add corrections to the potential that bend the potential upward at infinity, 
so that the good features of the model are preserved.] 

We now wish to calculate correlation functions among a new set of operators, 
called Oi , which will have useful scaling properties. Specifically, if we are in the 
kth critical mode, we demand that, by adding this function Oi to the potential 
U , we create a corresponding perturbation within w(R) corresponding to the 
Zth critical mode, that is, 

w k (R) + €[!(;,(/?)- l] (11.5.22) 

for small e. 

Because of the tight relationship between U and w(R ), we can find an 
explicit representation of this operator O t that generates the above change in 
w(R): 

/» 1 i 

Oi(<t>) ~ Tr / —[1 - «(1 - u)4> 2 ] 1 . (11.5.23) 

Jo « 

The above expression is actually divergent, but its divergent piece vanishes in 
the scaling limit. 

Let us now find the operator associated with this small perturbation. 
Recalling that 0 ~ 2 — H, we can write 

Oi ~ H l+l/2 . (11.5.24) 

We can take the matrix element of this operator, recalling that the matrix 
element of H is related to the KdV operator K[f(t)]. We can normalize Oi as 
follows: 


dt'K[f(t')] l+l ■ 1. 


/!! f°° 

^ = (21 + 1)!! I 


(11.5.25) 



11.5 KdV Hierarchy 367 


Notice that this correlation function is exact. No approximations, other than the 
double scaling limit, have been made. Now, let us go to the perturbative limit. 
As before, to lowest order in the string coupling constant String, the derivatives 
disappear in the Hamiltonian, that is, the primes disappear within the KdV 
hierarchy. Thus, 



■/+. (2/ + 1)!! 

(/ + !)! J 


(11.5.26) 


and f(t ) = t l/k . Solving for the value of this and higher correlation functions 
in the spherical (genus 0) limit, we find [22]: 


{Or) 

(O x 0 2 ) 


[Oi0 2 0 2 ) 


[4bp](Oi 0 2 0 3 0 4 ) 


A+(h + \)/k 


(/, + iX/i+* + i) 
1 


t (U+h+\)/k 


(h +h + V 

_}_ t -l+(h+h+h + l)/k 


"T h + h + h + 1 — k)t 


—2 +(/1 -h/2 +h +^4+ 1 )/ k 


(11.5.27) 

Correlations such as the above will be useful in establishing the equivalence 
between the matrix models in the double scaling limit and topological field 
theory, which will be discussed in Chapter 12. 

Last, before leaving the one-matrix model, let us make some comments 
about transitions between multicritical points. In principle, we should examine 
the possibility of transitions between different values of k in our formalism. 
Consider, for example, the string equation, 


t = '22(1 +fit, R,(u), (11.5.28) 

/ 


where we have now generalized Eq. (11.5.14) by summing over many 
multicritical points and introducing the variable for each critical point. (Nor¬ 
malizations can be absorbed into t/.) The variable u is now a function of t = t 0 
and ti. 

Equation (11.5.27) gives us a way in which to analyze the relationship be¬ 
tween many critical points. Surprisingly enough, we find this same equation 
appearing in topological field theory, which once again supports the claim that 
matrix models and topological field theories are the same. 

One way of implementing many critical points is to add new terms to the 
original action proportional to ti Oi , so that taking variations of the correlation 
functions with respect to ti brings down factors of <9/. This is a convenient 
formalism, but we have to physically interpret the meaning of Oo- From our 
original definition of (9/ as the correction to the potential that induces Eq. 
(11.5.22), we see that the effect of (9 0 is to change w by a constant, so that 
t = to is shifted. But, to is conjugate to the area of the random surface, so we 



368 11. 2D Gravity and Matrix Models 


can think of O 0 as a “puncture operator” P that does nothing but pick out a 
marked point on the surface. Symbolically, we can now summarize this by 


_ 3 _ 

dto 



(11.5.29) 


Since the susceptibility can be written in terms of the second derivative of the 
free energy, we also have 


u = {PP) = 



(11.5.30) 


From Eqs. (11.5.13) and (11.5.25), we also see that the expectation value of 
Oi is proportional to the integral of P/ + i with respect to r 0 - With a suitable 
normalization, we can write 


3fo 


m = R k 


7+1 • 


(11.5.31) 


Let us put this all together. Since a derivative with respect to to brings down 
a P, and a derivative with respect to U brings down an 0/, we now have, 
symbolically, 


d/dto = P, 3/3 h = O t . (11.5.32) 

This, in turn, implies the identity 

(OtPP) = L M = JL R l+l . (11.5.33) 

ot[ dto 

This last equation shows that the matrix models generate the / + 1 flow of 
the KdV hierarchy. In summary, when analyzing transitions between critical 
points, the fundamental equation governing the behavior of the system is given 
by the string equation, Eq. (11.5.28), in terms of functions P/, which obey 
the flows of the KdV hierarchy, Eq. (11.5.32) and the recursion relation, Eq. 
(11.5.13). 


11.6 Multimatrix Models 

So far, we have analyzed the one-matrix model, which successfully reproduced 
the behavior of two-dimensional gravity coupled to nonunitary conformal mat¬ 
ter, labeled by an index k. However, by generalizing the theory to include 
multimatrix models, we should be able to reproduce two-dimensional gravity 
coupled to a much wider class of conformal models, such as the minimal (p, q) 
series. 

The generalization to higher matrix models is straightforward. Let M t rep¬ 
resent q — 1 distinct N x N matrices. The q — 1 matrix model is defined by 



11.6 Multimatrix Models 369 


the functional integral 


Z = ln 



dMi exp 


-Tr 


q -1 q -2 

£ ^ ^ Q Mi Mi+\ 


i=i 


i=1 


( 11 . 6 . 1 ) 


As before, we diagonalize the Af/ matrix in terms of its eigenvalues and inte¬ 
grate over its angular part. Repeating the steps for the one-matrix model, we 
find 


= \nt n dk\ 

p=\,N 


where 


A(A,)A(V>) x exp - £ V(Xf) - 

l L i,P i.P 

( 11 . 6 . 2 ) 

A(A.,) = J~[ (kf — Af 2 ). (11.6.3) 

Pl<P2 


Now, define Q, and P t to be operators that represent insertions of A, and 
d/dki , respectively, into the functional integral. Although these operators are 
actually quite complicated, they must obey the simple relation 




(11.6.4) 


because A and d/dk have the same commutation relations. In the double scaling 
limit, experience shows that the insertion of an extra A into the integral can be 
accomplished via gth-order differential operators of the form 

Q = d q + {v,_ 2 (*), d“~ 2 } + • • • + 2v 0 (x), (11.6.5) 

where d — d/dx and the v t {x) are simple functions. 

Our strategy is to make this commutation relation, Eq. (11.6.4), the basis for 
the generalized KdV hierarchies [26]. We solve this commutation relation by 
constructing a new pth-order operator out of the operator Q h using the theory 
of pseudodifferential operators. We demand that this new operator satisfies the 
same commutation relations as before. We then identify this operator with P t . 

The theory of pseudodifferential operators tells us that it is possible to create 
a /?th-order operator from a ^th-order operator. Given the gth-order operator 
Q , we can define 


00 

Q x ' q s <* + £{<?,,</-''}, (11.6.6) 

;=i 

where we define the operator d~ x to satisfy 

oo 

d~ l f = £(-l ) j f u) d- J ~ l . 

j= o 

(With this convention, it is easy to check that d(d~ l f) = /.) 


(11.6.7) 



370 11. 2D Gravity and Matrix Models 


From the operator Q l/q , we can construct the pth-order operator Q p,q , such 
that 

= (H.6.8) 

where the plus (+) subscript means we take only the nonnegative powers of d 
in the expansion. A minus (—) subscript means taking negative powers of d. 
Generally, we define 

< 7-2 

Q = d q + '£ / {v i ,d i }, 

' =0 (H.6.9) 

Q p J q = f^{e i ,d- i ). 

i =1 

Evaluating the commutator yields 

< 7-2 

[P, Q] = lQ p + /q , Q ] = [Q, Qt /q ] = = l, (11.6.10) 

i=0 

where 

r i = qe’ q -\-i H • (11.6.11) 

Last, imposing that the commutator yields 1, sets all r t equal to zero, except 
for r 0 = Let us use this rather abstract formalism to first rederive the results 
for the one-matrix model and then to derive the results for the multimatrix 
model. 

Example: One-Matrix Model 

For the one-matrix model, the theory reproduces two-dimensional gravity cou¬ 
pled to (p, q) conformal matter, where p = 21 — 1 and q = 2. For q = 2, we 
take the second-order Hermitian operator for Q: 

Q = d 2 — u(x). (11.6.12) 

Our strategy is to construct the operator P = Q+ q = Q l + 1/2 , where 

Ql-l/2 = d 2l -1 _ ^Lzl [Uy d 2l-l } + . . . . (11.6.13) 

We break this up into two pieces, containing positive and negative orders, 

g'- 1/2 = Q l + 1/2 + Q l : yl , (n.6.14) 


with 


Q l I l/2 = {R l ,d- 1 } + 0(d~ 3 ) 


(11.6.15) 



11.6 Multimatrix Models 371 


where Ri is yet undetermined, and 

Q+ 2 = d, Q^ 1 = <P-\{u,d), 

Of = d 5 - f {«, d 3 } + ^{(3 m 2 + u"), d}, 6 

Q+ 2 = d 1 - \{u, d 5 } + f| {{u 2 u"), d 3 }, 

- ^{[13u (4 > + 10 uu" + 10w 3 + 25 {u'f], d). 

I _j/2 

The /?/, which appears in Q_ , can be calculated by taking repeated 
commutators of the various <2’s. By evaluating 


e *+'/2 = ee'-i /2 = 

(11.6.17) 

we find 


G+ 1/2 = \{Q+ 1/2 Q + QQ+ X/2 ) + {Ri, d}. 

(11.6.18) 

Commuting both sides with Q, we find the recursion relation 


*: + , = \R'{ - uR; - \u'R h 

(11.6.19) 


which is the same recursion relation satisfied by the Ri found previously in Eq. 
(11.5.13). Thus, we have now made contact with the KdV equations. 

Last, we can show 

[Q+ l/2 , Q] = 4/?;. (11.6.20) 

The right-hand side must equal 1. Integrating that relation to remove the prime, 
we now have (after rescaling): 

(/ + !)/?;(«) = ( 11 . 6 . 21 ) 

which is the string equation found earlier in Eq. (11.5.28). 

It is then straightforward to generalize the method to higher matrix models. 

Example: Two-Matrix Model 


For the two-matrix model [27, 28], we are interested in the coupling be¬ 
tween two-dimensional gravity and (4, 3) conformal matter, or the critical 
Ising model. We find 


Q = d 3 - |{w, d} + \w = K+ 2 + | w, 
p = = ef = k 2 + (u), d} + v. 


( 11 . 6 . 22 ) 


where v = —8R 2 /3, the operator K is the Q found in the one-matrix model ion 
Eq. (11.6.12), and w is a breaking field resulting from coupling to a magnetic 
field. The commutation relations yield 

1 = [P, Q ] = -{( 4*2 + \v'),d 2 } + - 3 (uwy,d) 

+ 8 m *2 + + \uv' + \{w 2 y. 


(11.6.23) 



372 11. 2D Gravity and Matrix Models 


Solving the equation and then integrating (so the number 1 becomes jc), we 
find 

* = — 8/?3 + \uv -\v" + \w 2 (11.6.24) 

which is the solution for the two-matrix model. The higher matrix models can 
be done in a similar matter. 


Example: Three-Matrix Model 

For the three-matrix model [27, 28], where we couple to (5,4) conformal 
matter (tricritical Ising model), we have 

Q = K 2 -)- {w, d} + v, 

P = Q ‘^ = Qf = K? + § {w, d 2 } + §{v, d] - I uw. 

We repeat the same steps. The commutation relations yield 

2 

1 =[P, fi] = ][>,</*}, 

1=0 

where 

0 = r 2 = 4/?3 + §v'" - f(««y + f(u; 2 )', 

0 = n = -l[±u»"" - f (uw)" - | uw" - Ivw + !« 2 u>]\ 

1 = 2 r 0 = - 8 R 3 U - iu (5) + |( 2 u'v" + u"v' + 2 vv' + 3u 2 v' + 4 uu'v) 

— -ww"' — 5 w'w" + %w 2 u r -I- %uww'. 

2 2 2 

Solving the equation and integrating, we have the final result: 

x = 8 fl 4 + ^u (4) + |u 2 + y u2 v — l( uv " + mV + uw") 

— Iiuiu" + |w 2 w + Tm, (11.6.28) 

where T is an integration constant. 

In summary, using the method of quasi-differential operators, the funda¬ 
mental relationships, Eqs. (11.6.4) and (11.6.11), have given us the differential 
equations that define the nonperturbative behavior of the scaling functions. 


(11.6.25) 


(11.6.26) 


(11.6.27) 


11.7 D = 1 Matrix Models 

Up to now, we have only treated the case of two-dimensional gravity coupled 
to conformal matter. Although the exact results involving the KdV hierarchy 
are encouraging, we do not expect many of the features to survive when we 
enter the region D > 1, where the KPZ formalism breaks down. In fact, as 
we approach D — 1 with k -> oo, the potential becomes infinitely steep, the 



11.7 D — 1 Matrix Models 373 


exponents become complex, and the formalism that we have developed breaks 
down. 

What is remarkable, however, is that the case of D = 1 is still solvable and 
serves as a testing ground for many of our ideas concerning nonperturbative 
strings [29-32]. For example, we expect the perturbation theory to break down 
as we pass D = 1 because the ground state particle becomes a tachyon. 
Evaluating the zero-point contribution to the string, we find that the mass 
squared of the ground state is 

m 2 = ^(l -D) (11.7.1) 

showing that the ground state becomes massless at D = 1 and tachyonic 
beyond that limit (D — 1, not D — 2, appears in this equation because the 
longitudinal mode does not decouple in noncritical string theory). Thus, we 
expect the exact solution for the D = 1 case should reflect this fact, which 
will destabilize the perturbation theory but perhaps allow a nonperturbative 
interpretation. 

For the D = 1 case, the matrix field M is now one dimensional, that is, de¬ 
pends on a time variable, and we can write the equivalent Schrodinger equation 
in this time variable 


Hf = N 2 E m {g)ir, 

where 

H = -±A + V, 

a 2 i d 2 d 2 

~ ^ dMl + 2 ^ 3 Re M 2 + 3 Im M 2 ’ 

l H 1<J U IJ 

V = ^ TtM 2 + fj TrM 4 . 


(11.7.2) 


(11.7.3) 


Now, let us repeat the same trick that we introduced before, decomposing the 
matrix M in terms of its eigenvalues a, and its diagonalizing matrix U, which 
can be trivially integrated over. We find that the expression for the energy is 
given by 


E a) (g)= lim 

N-HX) 


1 + V(\>)f 2 ] 

n 2 in m^n 


(11.7.4) 

Now, let us make the key observation, which will render this problem almost 
trivial. We will redefine a new wave function 0 as follows: 


(p(X\, ..., = 


]> - V) 

J<j 




(11.7.5) 


The first remarkable feature now emerges. Notice that the presence of this 
new factor has rendered the wave function 0 to be antisymmetric, as if the 
theory were based on fermions rather than bosons. The second remarkable 



374 11. 2D Gravity and Matrix Models 


feature is that the highly coupled Schrodinger equation now reduces to a 
noninteracting theory when we eliminate the U matrix for the ground state 





(11.7.6) 


This result shows the power of the substitution. Notice that the theory, which 
at first seemed intractable, has now been reduced to an almost trivial problem, 
the theory of a Fermi gas interacting via a central potential given by 

A. 2 g 4 

y + n x ' < 1L7J > 

This theory can now be solved using standard techniques. For example, we 
know that in a Fermi gas, we can let e\ < ei < • • * represent the energies of 
each fermion subject to the Hamiltonian 


h 


lil ^ 

2 dX 2 + 2 




(11.7.8) 


Let /x F be the Fermi level. Then the total energy is the sum of the individual 
energies below the Fermi level or 


N 2 E w = J2 k e k 0(lx Y~e k ), 
N = ^0(/a f -e k ). 

k 


(11.7.9) 


So far, everything has been exact. Now, let us introduce the large N ap¬ 
proximation, so that the eigenvalues A, become a smooth function k(x). Then, 
ek can be written in terms of p, the momentum of the particle. We can then 
integrate over p: 


N 2 E m = NfM ? - j d ^-e (/z F - ^ 


X 



A2 

2 



N 


-f 


dX dp 
2tc 1 



El 

2 


X? 

2 


X 2 _ 
2 




(11.7.10) 


Let us now integrate over p and then rescale by a 1~NX and fif -> Ne . 
Then, we have 

E ( '\g) = €- f ^(2€-X 2 - 2gX 4 ) y2 d(2e - X 2 - 2 gX 4 ), 

J (11.7.11) 

1 = / —— (2e — X 2 — 2gA 4 ) 1/2 6>(2<? - X 2 - 2gX 4 ). 

J 2n 

This simple large N approximation has yielded a solution for the energy that, 
compared with numerical results of the anharmonic oscillator, yields results 





11.7 D = 1 Matrix Models 375 


that are only off by at most 12%. For example, asymptotically, the planar 
approximation yields [17]: 

E { \g) ~ .58993g 1/3 (11.7.12) 

while the exact result is 

E (l \g)^ .66799g 1/3 . (11.7.13) 


We will now approach the D = 1 from an entirely new direction. We will 
exploit its resemblance to a system of harmonic oscillators. This, in turn, will 
give us the ability to perform both weak and strong coupling approximations 
which will reveal the non-Borel summability of the weak coupling approxima¬ 
tion. In this approximation, we will solve for the exact solution for the ground 
state energy. We will find it convenient to introduce p, the density of states, as 
follows: 

p(e)=^J^S(e n -e). (11.7.14) 

P n 


Then, the ground state energy E gs in Eq. (11.7.9) can be written as 

r V-f ft /*Mf 

Egs — P 2 I P(e)ede, £ = -=/ P(e)de, ( 11 . 7 . 15 ) 

Jo P Jo 

where /z F is the highest energy level, the Fermi level. 

In the critical limit, we will take the limit N / /3 = g —> g c . We will adjust the 
potential so that U = /z c as it approaches the Fermi level from above. Then, we 
define the cosmological constants A and /z as A = g c — g and /z = /z c — /z F . 
We will find it convenient to work with derivatives of Eq. (11.7.15): 


dg_ 

3/z 

3£g S 

3/z 


—piP' f)> 


-^VfP(Mf) = /^Vf 

3/z 


(11.7.16) 


Our strategy will be to solve for p and then invert to find the ground state 
energy as we perturb in 1 //?/z, which is the weak coupling limit, or /J/z, which 
is the strong coupling limit. 

The key observation is that, in the scaling limit, the system resembles an 
inverted harmonic oscillator. 

To see this, we define 

P(P f) = Im Tr -- - --, (11.7.17) 

7tp H — /Z F — l€ 


where h is the Hamiltonian. If we expand near criticality: y ~ x c — x ~ 0, so 
h — /z F ~ —(l/2fi 2 )dy — /z F — 2 y 2 + 0(y 3 ). For small y, we can ignore the 
cubic and higher terms, so the theory is dominated by the y 2 term, that is, it 
corresponds to an inverted harmonic oscillator. This is a great simplification, 



376 11. 2D Gravity and Matrix Models 


because we have now reduced a rather complicated system to solving a known 
system where we can use well-established methods to find its solutions. 

We begin by remarking that the density of states of the normal harmonic 
oscillator, of frequency co , is 


p(E) = - Im V 

71 


1 


[n + ^]ojR — E — ie 
When we continue co to imaginary frequency, we have 

1 


P(Vf) = -ReV 

71 L — 4 


2 n —|— 1 —|— ifijJL 


(11.7.18) 


(11.7.19) 


This is singular, but the divergent part is fi dependent and hence will disappear 
in the critical limit. We thus have 


p(M F ) = (l/2jr)Re{?[l. I(l+«^)]} 

= (1/2tt){ lnOS/ 2 ) - RetA[(l + ifin)/ 2]}, (11.7.20) 


where 


OO i 

Z(z,q) = iv ’ 

Hz) = 


n= 0 

r(z) 

r(z)' 


(11.7.21) 


Let us now take the weak coupling limit (which is an expansion in 1 //!//). 
Power expanding the £ function in Eq. (11.7.20), we find 


1 1 00 1 
—Im£ r — j 


_ 1 _ 
2 n 

J_ 

2 71 


nPl* ^ 1 - i'(2« + l)/0/z 

00 

— In /x + 2 ^^(— l) fc (^M) _ 2 i ( 2 2 * -1 - 1)?(1 - 2 k) 

\B 2m \ 


k=i 

00 


-ln/x + ^(2 2m -' - 1) 


m=1 


m(fiii) 2n 


(11.7.22) 


where B 2m are the Bernoulli numbers. 

Now, we invert. We integrate the expression dg/dp, = — p(/x F ) in Eq. 
(11.7.17) to find 




-ln/x-^(2 2m - 1 - 1)- 


|B 2 „ 


m=l m(2m - \)(Pp) 2m 

Inverting again to find the energy, we find [29]: 

oo n 


E “ ~ 


l + In A)" 


n =1 m=l 


(11.7.23) 


(11.7.24) 



11.7 D = 1 Matrix Models 377 


where €„ tm are constants, and the string coupling constant is 


In A 
2 nfi 2 A 2 


(11.7.25) 


The important point is that we have logarithmic singularities appearing in the 
series, which spoil the perturbation theory. In fact, no matter how small we 
make g 2 , we still find these singularities. 

These logarithmic singularities probably reflect the presence of the massless 
mode in the theory (which becomes tachyonic for D > 1). This is due to the fact 
that we can always attach a tadpole with the massless particle to any Riemann 
surface. Thus, this is an infrared problem. It also means that the nice picture of 
summing over Riemann surfaces breaks down perturbatively. In addition, the 
perturbation theory also suffers from the fact that diagrams diverge as (2n)!, 
meaning that it is not Borel summable in the weak coupling limit. 

Now, let us analyze the strong coupling limit as a power expansion in 
where everything is well behaved. We power expand Eq. (11.7.19): 


pOf) = In m ~ Re Y) 
In ir ' 


1 


X ^ (2n 4- 1)(1 + ifiii/{2n + 1)) 


(11.7.26) 


We repeat the same steps, inverting a series of equations. We find 


3A 
3 fi 


= />(/* f ) = - 2 - In /* + - ^(- 1)*[1 - 2 

2jr * tl 


- (2 * +1) ]f (2 k + 

(11.7.27) 


Inverting, we find for fi\ 
2n A 


M ~ 


In A 


1 + 


00 00 / R 2 A 2 \ n 

n —1 m=n +1 v 7 


(11.7.28) 


which gives us the final answer [29]: 


-gs 


i / 00 oo 


bg S 


n— 1 m—n -\-1 


4" ln " 


(11.7.29) 


where d nm and 6 nm are constants. In contrast to the weak coupling limit, we 
find that the strong coupling limit is well behaved. 

So far, we have been able to solve for the energy of the D = 1 theory because 
of a key observation, that the problem reduces to that of an inverted harmonic 
oscillator. We need to find a more comprehensive formalism, however, if we 
are to understand the physical origin of the curious logarithmic divergences 
and the non-Borel summability. The formalism that we will introduce is string 
field theory [33-35]. 

Certain features which seem obscure in the previous discussion have a nat¬ 
ural explanation from the point of view of string field theory. For example, the 
ground state energies found earlier emerge directly as eigenstates of the string 
field Hamiltonian. Also, the curious logarithmic infinities found earlier, which 



378 11. 2D Gravity and Matrix Models 


apparently ruin Borel summability, can be interpreted as the length of one of 
the compactified dimensions. The mysterious Liouville mode also has a nice 
reinterpretation in string field theory; it appears as the space of eigenvalues of 
the matrix M, and combines with the single dimension of D = 1 string theory 
to give an effective D = 2 theory. Furthermore, the fact that D = 1 is solv¬ 
able, which appears obscure in the previous section, appears almost obvious 
because the string field theory action is integrable. 

We saw earlier that string field theory enables us to describe all the states of 
the string in a single field 'P, whose decomposition yields all the states in the 
Fock space, and whose products yield the interactions of the theory. 

We will introduce string field theory via our earlier observation that the 
D = 1 theory reduces to a fermionic theory of uncoupled particles. We noticed 
that, if we diagonalize our matrix field M into eigenvalues via M = 
then the Hamiltonian reduces to 




where n, y depends on the angular part of the M matrix. Because this FI i; factor 
is difficult to work with, we will only consider SU(N) singlet states in which 
the angular part of the Hamiltonian is exactly zero, so that this term vanishes. 
We can further reduce the system by introducing t/r(A.,) = A(X, )^>(7. I ), which 
yields an antisymmetric, fermionic wave function. 

In the singlet sector, the matrix model reduces to ordinary quantum mechan¬ 
ics with N noninteracting fermions moving in a potential U(X), with Planck’s 
constant^ given by 1 //3 ~ 1 /N. In this new basis, the Hamiltonian reduces to 

h = ~^ +um - (117 ' 31) 

Let us now introduce a fermionic string field which in turn is a linear 
superposition of an infinite number of states V'V with energy e t : 

xp(X, t) = (11.7.32) 


In this representation, we can easily rewrite our original Hamiltonian in 
terms of this string field 

/ T 1 3\J/t 3*F 1 

dX lw IT 9X + - AOJ , (11.7.33) 

(we adjust the Lagrange multiplier /x F to equal the Fermi level). 

This Hamiltonian, in turn, can be derived from the following action: 

/ T 1 3*1^ 3U/ 

dt dX I ivptvi/ ______ U(X )+ /r F ('I' t 'P - N) 

(11.7.34) 



11.7 D — 1 Matrix Models 379 


This string field theory action contains all the information of the D = 1 matrix 
model for the SU(N) singlet sector. Notice that the action appears to be defined 
in two dimensions if we treat X and t as two space-time coordinates. This is 
how the Liouville mode appears in our formulation. 

Notice that the previous action appeared to be nonrelativistic. However, with 
some modification, we can rewrite this in a more relativistic fashion. Let us 
define 'I't and 'I'*: 


*(*, 0 = 


g'>F ' 

V2v(X) 

+ exp (ifi J ^u(V)-/;r/4W(A,f)], (11.7.35) 


exp (-ip J 


dHv{X') + in/4)4> L {X,t) 


w 


where v(X) is the velocity of a classical particle at the Fermi level in the potential 
U(X\ that is, v(X) = dX/dr = 

In terms of these new variables, the action becomes 




-T/2 

H = I dz 
Jo 


- Wid r V L + + 3 r ^9 T ^) 


It t /u" 5 (v') 2 \ 

The equations of motion can be read off the action 

iy^d^ = K'P, 

where <),, = (d t , d T ) and 


(11.7.36) 


(11.7.37) 


K 


_ ’ 1 „ 1 (v" 5(u') 2 \l ni7 ,ov 

_K0 L r 2 Pv* T 4p \u 3 2u 4 /J (H-7.38) 


One advantage of string field theory is that we can see that the system is 
integrable. Since the field theory is based on free fermionic fields, we can 
construct an infinite number of conserved currents 


/ 'ul = +<*, 0YhT 


exp 


«■ 


Yi K(t, d z )dz' 


)] 


vF(r,0. (11.7.39) 


The existence of this infinite set of conserved currents, in turn, indicates that 
the system is integrable. 

In fact, it also indicates that the system is topological. In the next chapter, 
we will see that the Green’s functions found in matrix models is identical to 
the Green’s functions found for topological field theory. 



380 11. 2D Gravity and Matrix Models 

11.8 Summary 

Matrix models provide the first nonperturbative information concerning string 
theory. They serve as a theoretical laboratory in which to test many of our 
ideas about string theory. However, there seems to be a qualitative problem in 
extending the beautiful results of matrix models beyond D = 1. 

Matrix models begin with string theory defined at below the critical dimen¬ 
sion, where we must be careful to calculate the contribution of the function 
measure. After a scale transformation, both the string and the ghost parts 
contribute to the Liouville action 

S L (o\ g) = j d 2 zjg(\g ah d a cr d b o + Rcr + fie a ) (11.8.1) 

which can be eliminated only in 26 dimensions. 

By scaling arguments, KPZ showed how to calculate the string susceptibility. 
One can show that the partition function diverges as 

Z(A) ~ A^ +y exp(/U), (11.8.2) 

where y is the string susceptibility and is given by 

y(h) = £(1 - h)(D -25-V25-D y/T^~D) + 2, (11.8.3) 

where h is the genus number. One of the early successes of matrix models was 
their ability to reproduce this result for the susceptibility. 

Matrix models are based on the old assumption that Feynman graphs for the 
matrix model, when viewed as a power expansion in 1/A 2 , become planar to 
lowest order. Hence, we can view them as approximations to two-dimensional 
gravity. For higher powers of 1 / A 2 , the power expansion becomes identical to 
a power expansion in the genus of the two-dimensional surface. 

We will take the action 

D 

L — ^ Tr (d^M a M M f ) + Tr(MM f ) + — Tr(MM t MA/ t ). (11.8.4) 

M=1 N 

More generally, we can have the interaction 

U = J2 v i = g 3 TrM 3 + g 4 TrM 4 + ---. (11.8.5) 

1=3 

The generating function is given by 

Z(fi) = / D<t> exp [ —0Tr [/(<&)] (11.8.6) 

with potential U. We are interested in the double scaling limit 

A oo, /J/A —> const., (11.8.7) 

where we will find universality, that is, most of the particular properties of the 
potential are washed out. 



11.8 Summary 381 


There is an old trick that allows us to solve most of these models exactly 
and that is to decompose M in terms of its eigenvalues and its angular part. 
The angular part, in fact, decouples, leaving us with the measure 

fl dMij = Y\ dh J~[(A, - Xj) 1 dUij , ( 11 . 8 . 8 ) 

ij i i <j 

where dTl can be trivially integrated over for our case. 

Next, we evaluate the integration over the eigenvalues k using the method 
of orthogonal polynomials, which are defined via 

I dkexp[-pU(k)]P n (k)P m (k) = 8 n , m h n . (11.8.9) 

It is possible to construct these polynomials so that they satisfy a recursion 
relation for any potential U (A.): 

kP n (k) = P^W + SnPnM + RnPn-xik). (11.8.10) 

Using the recursion relation, one can also show 

h„+1 = J dkexp[-pU(k)]P n+l kP n 

= J dkexp[-pU(k)](P n+2 + S n+l P n+ 1 + R n+l P n )P n 

= R n +\h n . ( 11 . 8 . 11 ) 

The point of using these orthogonal polynomials is that we can write the 
partition function simply in terms of them 

N -1 

Z N = N\ Y\hi. (11.8.12) 

1=0 

Using these recursion relations, we arrive at the equation 

- = 2R n Yjj? + 1)^+1 - R a P > (11.8.13) 

P p= 0 paths 

where the sum over paths is rather complicated. To lowest order, we have 

t = f 2 + f\ (11.8.14) 

where / is the scaling function found in the free energy. This solution is a 
Painleve transcendental of the first kind. 

It is tedious, however, to sum over paths as we go to higher and higher 
levels. It is much more convenient to convert to an operator language and treat 
the problem from a different perspective. As an operator expression, our basic 
recursion relations, in terms of the potential £/, are 


(n\U'(k)\n) =0, 
(n-l\U'(k)\n)=n/pVR„. 


(11.8.15) 



382 11. 2D Gravity and Matrix Models 


We will introduce the KdV hierarchy through the operator 

K[m, v,] = -~ + m + 2-/(ov, 


and 




$ + H 




(11.8.16) 

(11.8.17) 


Then, we can show that the matrix element of the inverse of the Hamiltonian 
can be given as 




W) 

io £ z+1/2 ’ 


Ri(f) = { - V,]}' • I. 


(11.8.18) 


We now have all the tools necessary to solve for t . By inserting the expression 
for the resolvant R(t, t \§) into the expression for t , we find that Eq. (11.8.15) 
reduces to 


' = WW, v ,]|‘■ !. (11.8.19) 

This is the final answer, which includes the earlier solution. The lowest order 
solutions include: 


k = 1: 

t = f , 


k — 2: 

t = f - \r. 


CO 

II 

t = f- ff" - i(/') 2 + ^/ (4) , 

(11.8.20) 

II 




+ \fr + \ff w -h 


By comparing our results with the KPZ equation, we can find which two- 
dimensional theory the matrix model is approximating. For k — 2, we have 
the case of pure two-dimensional gravity (i.e., zero-dimensional strings). For 
k = 3, although initially suspected to be the coupling of two-dimensional 
gravity with the unitary minimal series, it is now known to approximate two- 
dimensional gravity coupled to nonunitary conformal matter. 

To see how to get two-dimensional gravity coupled to the unitary minimal 
series and other forms of (p, q) conformal matter, we must now generalize 
our approach to the multimatrix approach, where we introduce several types 
of matrices Mi obeying 


Z = In 



d Mi exp 


-Tr 


<?-1 <?-2 

^V,(M,)-^c i M,M i+1 


1 = 1 


1 = 1 


( 11 . 8 . 21 ) 


The solution to these equations can be solved via the theory of pseudodiffer¬ 
ential operators. 




11.8 Summary 383 


Define Q t and P t to be operators that represent insertions of k t and d/dk i9 
respectively, into the functional integral. Although these operators are actually 
quite complicated, they must obey the simple relation 

[Pi,Qi] = l (11.8.22) 

because k and d/dk have the same commutation relations. 

A solution to these equations, in terms of <2*, can be found once one in¬ 
troduces the definition of Q~ n via pseudodifferential operators. We find the 
solution 


[Ql /q ,Q] = 1, (11.8.23) 

where the plus (+) subscript means we take only the nonnegative powers of d 
in the expansion. A minus (—) subscript means taking negative powers of d. 

By solving for these equations, we can reproduce the one-matrix result. For 
higher matrix models, we find the coupling of two-dimensional gravity to the 
minimal unitary series, as well as (p , q) conformal matter. Our goal, however, 
is to model string interactions in higher dimensions. Let us analyze the D = 1 
case, which may be the limit for matrix models. 

We begin with the density function for states in the matrix model 

P(e)=-^5(e„-e). (11.8.24) 

^ n 

Then, the ground state energy E gs can be written as 

N pvt pvT 

8 = = J 0 P^) de ' Egs = P 2 J p(e)ede, (11.8.25) 

where ix f is the highest energy level, the Fermi level, and 

P0* F ) = (l/2jr)Re{?[l, ±(1 + ip/x)]} 

= (\/27T){ln(p/2)-RexJr[(l+ipti)/2]}. (11.8.26) 

Perturbing in the weak coupling limit (expanding in 1 /Pfi), we find that the 
energy is not well behaved 


Egs — 


1 

Sst 


oo n 

i+EE e ».^s,"(-inAr 

n =1 m —1 


(11.8.27) 


that is, the theory behaves badly, and is not Borel summable. In the strong 
coupling limit (expanding in l//J/i), we find perfectly acceptable results for 
the energy 


F — 
^gs — 


4 


>+££ 


n= 1 m=n+\ 


4" ln " 


(11.8.28) 


This seems to confirm our earlier conjecture about the non-Borel summability 
of the perturbation theory. 



384 11. 2D Gravity and Matrix Models 

References 


1. V. Kazakov, Phys. Lett. 60B, 2105 (1988). 

2. V. Kazakov, I. Kostov, and A. Migdal, Phys. Lett. 157B, 295 (1985). 

3. F. David, Nucl. Phys. B257 [ FS14], 45, 543 (1985). 

4. J. Ambjom, B. Durhuus, and J. Frohlich, Nucl. Phys. B257 [FS14], 433 (1985). 

5. J. Ambjom, B. Durhuus, J. Frohlich, and P. Orland, Phys. Lett. Nucl. Phys. B270 
[FS16], 457 (1986). 

6. J. Jurkievic, A. Krzywicki, and B. Peterson, Phys. Lett. 168B, 273 (1986). 

7. A. Billoire and F. David, Phys. Lett. 186B, 279 (1986). 

8. D. V. Boulatov, V. A. Kazakov, I. K. Kostov, and A. A. Migdal, Nucl. Phys. B275, 
641 (1986). 

9.1. Kostov and M. Mehta, Phys. Lett. 189B, 247 (1987). 

10. V. Kazakov and A. Migdal, Nucl. Phys. B311, 171 (1989). 

11. B. Sakita and M.A. Virasoro, Phys. Rev. Lett. 24, 1146 (1970); H.B. Nielsen and 
P. Olesen, Phys. Lett. 32B, 203 (1970). 

12. G. ’t Hoof*, Nucl Phys. B72, 461 (1974). 

13. A. M. Polyakov, Mod. Phys. Lett. A2, 899 (1987). 

14. V. G. Knizhnik, A. M. Polyakov, and A. A. Zamolodchikov, Mod. Phys. Lett. 
A3, 819 (1988). 

15. F. David, Mod. Phys. Lett. A3, 207 (1988). 

16. J. Distler and H. Kawaii, Nucl. Phys. B231, 509 (1989); see also: J. L. Gervais 
and A. Neveu, Nucl. Phys. B238, 125 (1984). 

17. E. Brezin, C. Itzykson, G. Parisi, and J.-B. Zuber, Comm. Math. Phys. 59, 35 
(1978). 

18. D. Bessis, C. Itzykson, and J.-B. Zuber, Adv. Appl. Math. 1, 109 (1980). 

19. D. Bessis, Comm. Math. Phys. 69, 147 (1979). 

20. C. Itzykson and J. B. Zuber, J. Math. Phys. 21,411 (1980). 

21. E. Brezin and V. A. Kazakov, Phys. Lett. 236B, 144 (1989). 

22. D. Gross and A. Midgal, Phys. Rev. Lett. 64, 127 (1990); Princeton preprint 
(1989). 

23. M. Douglas and S. H. Shenker, Nucl. Phys. B335, 635 (1990). 

24. T. Banks, M. R. Douglas, N. Seiberg, and S. H. Shenker, Phys. Lett. 238B, 279 
(1989). 

25.1. Gelfand and L. Dikii, Uspekki Mat. Nauk. 30, 5 (1975). 

26. M. R. Douglas, Phys. Lett. 238B, 176 (1989). 

27. P. Ginsparg, M. Goulian, M. R. Plesser, and J. Zinn-Justin, HUTP-90/AO15 
(1990). 

28. M. Kreuzer and R. Schimmrigk, Santa Barbara preprint NSF-ITP-90-30 (1990); 
H. Kunitomo and S. Odake, University of Tokyo preprint UT-558 (1990). 

29. D. Gross and N. Miljkovic, Princeton preprint PUPT-1160 (1990). 

30. P. Ginsparg and J. Zinn-Justin, Elarvard preprint PUPT-1160 (1990). 

31. G. Parisi, Rome preprint ROM2F-90/2 (1990). 

32. E. Brezin, V. Kazakov, and Al. Zamolodchikov, Ecole Normale preprint LPS- 
ENS-89-182 (1989). 

33. S. R. Das and A. Jevicki, Brown-Het-750 (1990). 

34. J. Polchinski, UTTG-15-90 (1990). 

35. D. J. Gross and I. R. Klebanov, PUPT-1198 (1990). 



CHAPTER 12 


Topological Field Theory 


12.1 Unbroken Phase of String Theory 

The fundamental problem facing string theory at present is our inability to 
select its true vacuum nonperturbatively. Until the true string vacuum can be 
discovered, it is impossible to determine whether the theory predicts nonsense, 
and must be discarded as yet another failed attempt at a unified field theory, or 
gives a valid description of our universe and a unification of all known quan¬ 
tum forces. The frustration is that string theory has been evolving backward, 
ever since its accidental discovery in 1968 by Veneziano and Suzuki, so its 
underlying geometry is totally unknown. 

By contrast, the “natural home” for Yang-Mills theory and the general theory 
of relativity are well known. Their “natural home” lies in the realm of unbroken 
local SU(N) symmetry or general covariance. Even if we study these theories 
in a domain where all their symmetries have been severely broken, we know that 
the near-miraculous properties that persist in the broken theory arise because 
of its underlying geometry. Likewise, perhaps the key to understanding the 
underlying geometry of string theory is to understand its natural home. 

There is a useful analogy that illustrates this problem. Hypothetically, one 
can ask the question of what might have happened to the evolution of physics if 
Einstein did not discover the work of Riemann and write the general theory of 
relativity in 1915. The most pessimistic scenario would be that relativity would 
not have been discovered until decades later, in the 1950s, as field theorists 
began a systematic search for higher spin field equations. The successes of 
spin-0 meson theories, spin-^ Dirac theories, and spin-1 Maxwell theories 
might have led to the study of purely hypothetical spin-2 systems in flat space. 

As Feynman independently discovered, gauge invariance is necessary to 
kill the ghosts of a spin-2 particle and maintain unitarity, but this necessarily 




386 12. Topological Field Theory 


complicates the search for the action. A simple cubic action for gravitons can 
be used to construct four-point scattering amplitudes, but these fail to maintain 
gauge invariance. A fundamental four-point contact term is necessary. But 
then, the five-point scattering amplitude fails to be gauge invariant, requiring 
the addition of a fundamental five-point interaction, and so on. The final result 
is a nontrivial nonpolynomial action. 

However, it might be discovered that this ugly and contrived nonpolynomial 
action possessed mysterious, near-miraculous properties. After a long and dif¬ 
ficult calculation, it might be recognized that the theory was independent of 
the classical background metric. Thus, a hunt might begin to find the natu¬ 
ral home for the nonpolynomial theory. However, even though the action was 
completely known as a power expansion, it might be a leap of logic to postulate 
that general covariance was the actual origin of these miracles. 

Likewise, we are still searching for the natural home for string theory. How¬ 
ever, there are strong indications that the natural home for string theory does 
not lie in the low-energy realm of perturbation theory around conformal field 
theories. 

There are several indications that surprises await us at high energies and 
high temperatures. First, the high-energy behavior of the multiloop amplitudes 
shows that the perturbation theory is not Borel summable, with the genus g 
amplitudes growing as gl. Also, there is a strong indication that a new “sym¬ 
metry” of some type is appearing at high energies beyond the Planck length. 
Second, the high-temperature behavior of multiloop string amplitudes shows 
that there may be a first-order phase transition occurring near the Hagedom 
temperature. 

These are indirect indications that, in analogy with ordinary gauge theories, 
an “unbroken phase” of string theory may be opening up at high energies and 
high temperatures. 

In some sense, the natural home of string theory is not the perturbation 
theory based on Riemann surfaces at all, but an entirely new domain. This is 
crucially important for the ultimate isolation of the true vacuum of the theory. 

At first, this may sound confusing, because for the past 75 years physi¬ 
cists have studied Einstein’s general theory of relativity, where we make the 
important approximation 

Smv = + K V< (12.1.1) 

where gffi is a solution to the classical equations of motion, usually taken to 
be the Minkowski metric of flat space. However, this approximation breaks 
local general covariance explicitly, leaving only global Poincare covariance in 
the action. Thus, most quantum mechanical approaches to general relativity 
inherently break general covariance. 

What would a theory look like in which general covariance was not broken, 
where we did not power expand around some background metric? Such a 
theory would look strange indeed. In a scheme where g^ v is power expanded 
around zero (without ever refering to the Minkowsky metric 5^ v ), there is no 



12.1 Unbroken Phase of String Theory 387 


light cone, no propagation of waves, no meter sticks, and no motion. In other 
words, physics as we know it apparently ceases to exist in a quantum theory 
where local general covariance is preserved exactly, in this “unbroken phase” 
of general relativity! 

An important step in probing the “unbroken phase” of string theory was the 
development of topological field theory. Witten originally created topological 
field theory [1] as an attempt to use the sigma model as a tool to construct 
topological invariants for manifolds, using the input of physics to solve prob¬ 
lems in pure topology. Specifically, quantum field theory was used to correct 
certain weaknesses in Morse theory. 

Given the success of the sigma model as a new mathematical tool, Floer 
[2-3] was then able to generalize Witten’s formulation to include the Chem- 
Simons Yang-Mills theory in three dimensions. This, in turn, gave rise to a 
powerful formalism by which to construct new topological invariants in three 
dimensions. 

Independently, working in four dimensions, Donaldson [4] startled the world 
of mathematics by creating new topological invariants in four dimensions using 
the input of physics: exploiting the instanton solutions of four-dimensional 
Yang-Mills theory. It had been known for decades that manifolds in D = 3, 4 
behaved in qualitatively different ways than in higher dimensions D > 5. 
For example, in Smale’s celebrated proof, the Poincare conjecture could be 
demonstrated for D > 5, but attempts to understand the nature of the Poincare 
conjecture for lower dimensions met with frustration. Thus, Donaldson’s use 
of Yang-Mills theory to settle a long-standing problem in mathematics created 
quite a sensation in the world of mathematics. His instanton formulation not 
only showed that the Poincare conjecture failed in four dimensions and that 
“exotic four-spheres” existed, his new polynomial invariants allowed one to 
distinguish between new classes of four-manifolds. 

At about the same time, Jones [5] was able to write new polynomial invari¬ 
ants for knots, making the first significant advance in knot theory in decades. 
Topology in lower dimensions was thus experiencing a renaissance, but these 
advances were occurring in a variety of scattered, unrelated directions, without 
any unifying theme or picture. 

Given this renewed interest in topological invariants, Atiyah [6] then asked 
several questions: Could quantum field theory be used to give a unifying 
approach to all these disparate results? In particular, could a quantum field 
theory be found in four dimensions to explain the new topological invariants 
of Donaldson that generalizes the three-dimensional theory of Floer? Also, 
could a quantum field theory be found in three dimensions that generates the 
polynomial invariants of Jones? 

If so, then quantum field theory would also solve a nagging defect in these 
purely topological formulations, that is, the inability to solve for explicit, 
analytic forms for the various topological invariants. Quantum field theory, 
however, might give a specific algorithm by which to calculate the numerical 
value of these invariants and perhaps generate new classes of them. 



388 12. Topological Field Theory 


Witten showed that the answer to all these questions was “yes” [7, 8]. In 
fact, the theories of Jones, Floer, and Donaldson gave rise to two classes of 
topological field theories: 

(1) metric-free topological models, where the theory is manifestly free of any 
metric dependence; and 

(2) cohomological topological field theories, where a background field may 
be present but the energy-momentum tensor is BRST trivial. 

We now turn to a discussion of these two approaches. 


12.2 Topology and Morse Theory 

Over the decades, it has become common knowledge that generally covariant 
theories can be created by introducing a metric tensor and then functionally 
integrating over all possible metric tensors in the functional measure. This is 
the way general covariance is implemented in relativity. However, this neglects 
important classes of theories in which general covariance is implemented in 
an entirely different fashion, such as theories that lack a metric tensor entirely 
or theories with a metric in which the Green’s functions are independent of the 
metric. 

These theories are called topological field theories, and have novel properties 
that separate them from all other quantum field theories. Because the Green’s 
functions are independent of the choice of metric, it means that they must be 
purely topological, generating numerical values of topological invariants for 
certain manifolds. But, this also means that they might possess a finite number 
of degrees of freedom, unlike the infinite number of degrees of freedom found 
even for the simplest point-particle quantum field theory. 

The first class of topological field theories, which are manifestly metric free, 
include the three-dimensional Chem-Simons theory found in Chapter 8 in our 
discussion of knot theory, where the Lagrangian is given by 

L = ^ J € iJk Tr [Ai(djA k - d k Aj) + § A,[A Jt A k ]]. (12.2.1) 

The action is manifestly locally generally covariant (because e ljk transforms 
as a density under coordinate transformations), yet it contains no metric what¬ 
soever to tell us how to define the light cone of Minkowski space. Therefore, 
as we have seen in Chapter 8, the correlation functions are given by Wilson 
loops. By numerically evaluating these correlation functions, we generate new 
classes of knot polynomials, generalizing the Jones polynomials. 

Another example of a metric-free topological model is 2 + 1 gravity [9], 
whose action is given by 

L = € abc € ijk 4fR%(a>), 

R% = dA C +v b J e co?-(j *+k). 


( 12 . 2 . 2 ) 



12.2 Topology and Morse Theory 389 


This model is usually thought to be both trivial and nonrenormalizable. How¬ 
ever, upon closer inspection, we see that these two attributes are actually 
mutually exclusive. 

In fact, upon closer examination, one can also show that (2+ l)-dimensional 
gravity, reinterpreted in this fashion, is equivalent to Chem-Simons 2+1 Yang- 
Mills theory [9]. The theory is hence exactly solvable. The correlation functions 
of the theory, not surprisingly, are knot invariants, which are topological and 
do not depend upon the metric of space-time. 

The second type of topological field theory, in which a metric explicitly 
appears but where the Green’s functions are independent of the metric, is much 
more complicated but also much richer in mathematical content. To understand 
cohomological topological field theories, it is first important to understand how 
they evolved out of an attempt to use quantum field theory to solve problems 
in pure topology and Morse theory. 

To understand the significance of applying quantum field theory to Morse 
theory, let us quickly (and not very rigorously) review some of the highlights of 
de Rahm cohomology, which is how topological invariants can be constructed 
for real manifolds. Usually, when analyzing the topological invariants of a real 
manifold Af, we traditionally begin with de Rahm cohomology, rather than 
Morse theory. On the manifold, we define p forms as follows: 

a) = dx /X| A dx A • • • A dx^ p . (12.2.3) 

The operator d acting on this form is defined as follows: 

dco = — /Xl — — dx j A dx A dx^ 2 A • • • A dx flp , (12.2.4) 

dx J 

where d is nilpotent, that is, d 1 = 0. 

We then define topological invariants on the manifold. The space of inde¬ 
pendent forms that are annihilated by d is denoted by ker d. However, we wish 
to subtract those forms that are themselves expressed as deb for some cb, that 
is, we wish to remove those states that are the image of d. Thus, we define the 
nth cohomology as 


H n = kerd/imd (12.2.5) 


for n forms. 

The dimension of the de Rahm cohomology is given by the Betti numbers 

b n = dimH n . (12.2.6) 

Notice that the de Rahm cohomology depends on the local, differential proper¬ 
ties of the manifold. On this smooth manifold, we can also define a homology, 
which depends on the global properties of the manifold. Let us define 3 to be 
the boundary operator, that is, it maps a manifold M into its boundary mani¬ 
fold 3 M. Then, we can also show that the boundary operator is also nilpotent: 
3 2 = 0. 



390 12. Topological Field Theory 


We can similarly define a homology on the manifold as follows: 

H n = ker d/im d. (12.2.7) 

The correspondence between cohomology and homology is made by showing 
that the boundary operator is dual to the operator d. This is done via Stoke’s 
theorem 



( 12 . 2 . 8 ) 


To make this correspondence transparent, let us define the “scalar product” 
between a manifold M and a form co: 


(MN - f 

Jm 

Then, Stoke’s theorem can be rewritten as 


(12.2.9) 


(M\dco) = {dM\co), (12.2.10) 

that is, by moving the d operator from the left-hand to the right-hand side of 
the scalar product, it has become 3. Thus, under very general conditions, these 
two operators can be shown to be dual to each other. We call d the coboundary 
operator. This, in turn, means that the two spaces, homology and cohomology, 
are dual to each other and have the same dimension. 

Thus, the Betti number can be written as 

b n = dim H n = dim H n . (12.2.11) 

From this, we can write the Euler characteristic, a topological invariant, 

X(M) = J](- \fb q (M). (12.2.12) 

q =o 

Last, one can also define the Laplacian dd* +d*d. A form co is called harmonic 
if it satisfies 


(dd* +d*d)co = 0. (12.2.13) 

Then, it can be shown that the Betti number b q is also equal to the number of 
independent harmonic q forms that one can write on the manifold. In summary, 
the de Rahm theory allows us to write topological invariants of manifolds based 
upon examining either the local or global properties of the manifold via the 
kernel of nilpotent operators d or d. 

However, there is also another, less powerful, method by which to analyze 
the topological invariants of a manifold, and this is Morse theory, which dif¬ 
fers markedly from de Rahm theory. Morse theory is not based on nilpotent 
operators, but on analyzing the critical points of a certain function defined on 
a manifold. 

To be a little more precise, let h be a function defined on the manifold, that 
is, a mapping of M onto the real numbers. Let P z be the critical points of this 



12.2 Topology and Morse Theory 391 


function, that is, 


dh(Pi)/dx k = 0 (12.2.14) 

for k = 1,2,..., dim M . Let us now define the Hessian as the matrix 

d 2 h(x)/dxi dxj. (12.2.15) 

We define the Morse index p(P<) at the critical point P, as the number of 
negative eigenvalues of the Hessian of h. We then define M p as the number 
of critical points with the Morse index p. Then, one of the results of Morse 
theory is the inequality relating M q of Morse theory and b q of de Rahm theory 

M p > b p . (12.2.16) 

In terms of Morse theory, the Euler characteristic can be shown to equal 

X(M) = £(— \fM q . (12.2.17) 

< 7=0 

We see, therefore, that Morse theory differs qualitatively from de Rahm theory. 
Morse theory depends on the properties of a manifold at its critical points, rather 
than the cohomology or homology of nilpotent boundary operators. 

Unfortunately, Morse theory is not powerful enough to calculate the Betti 
numbers in terms of operators defined on Morse theory. Morse theory is weaker 
than de Rahm theory, yielding mainly inequalities. Let us illustrate this new 
approach with an example [10]. 

Example: Gravitational Potential 

The simplest example of Morse theory is a two-manifold M placed in the earth’s 
gravitational field. We then choose h(x) to be the height of the point x (i.e., 
the gravitational potential). At every point x on the surface of the manifold, 
we can assign a real number h(x\ its height off the ground. 

For example, imagine a torus with two handles so that it appears upright, as 
in a figure eight. This torus has six critical points at which the derivative of the 
height function at these points is zero. Each loop has two critical points, and 
there is a critical point at the very top of the figure eight and one at the very 
bottom. By taking the second derivative of the height function at these critical 
points, we obtain the Hessian and can then calculate the number of negative 
eigenvalues at each critical point. Physically, this corresponds to finding the 
points on the surface where a marble could be placed in stable or unstable 
equilibrium. 

The lowest critical point at the bottom is stable and has Morse index 0. The 
number of critical points with index 0 is 1, so M 0 = 1. The highest point on 
the surface is unstable and has Morse index 2. The number of critical points 
with index 2 is also 1, so M 2 = 1. The other four critical points, located in the 
holes, are saddle points, with one stable and one unstable direction, so they 
have Morse index 1. Since there are four such saddle points, M\ — 4. Putting 



392 12. Topological Field Theory 


these all together, we then find that the Euler characteristic is 


X = M 0 - M x + M 2 = 1 - 4 + 1 = -2 (12.2.18) 

which is indeed the Euler characteristic for a torus of genus g found from de 
Rahm theory. 

This can also be easily generalized to two-dimensional surfaces of arbitrary 
genus g. We know that the Betti numbers for this surface are given by 

h = b 2 = 1, bi=2g, (12.2.19) 

so that the Euler characteristic is given by 


X (M) = 2(l-g). (12.2.20) 


If we now compare this to the negative eigenvalues of the height function, we 
find an exact correspondence. We find that the Betti numbers b q equal the M q 
for this surface. 

For more complicated manifolds, however, this exact correspondence be¬ 
tween Betti numbers and M q breaks down and only the weak Morse inequalities 
apply. In particular, we are unable to calculate exact expressions for the Betti 
numbers via Morse theory. 

However, we will now greatly expand the power of Morse theory by using 
techniques from an unexpected source: supersymmetric quantum field theory. 
Using the input of physics, we will now show how to improve upon the old 
Morse theory by calculating the exact expression for the Betti numbers. 

We begin by defining a modified set of coboundary operators as a function 
of some fictitious “time” parameter r: 


d t = e th de~ th , 
d* = e th d*e~ th , 


( 12 . 2 . 21 ) 


for some Morse function h(x). We then define b q (t) to be a r-dependent Betti 
number 


b q (t) = dim ker (d t d* + d*d t ). (12.2.22) 

Although b q (t) is a r-dependent number, it is also a discrete function and is 
therefore independent of r. Thus, we also have b q (t) — b q ( 0). 

Last, we also define the r-dependent Hamiltonian as half of the r-dependent 
Laplacian 


H t = \{d t d* + d*d t ). (12.2.23) 

Notice that the number of zero energy states of this Hamiltonian at r = 0 is 
also the number of harmonic forms and hence equals the Betti number. Thus, 
the Betti number counts the number of zero energy states of the Hamiltonian 
at r = 0. 



12.2 Topology and Morse Theory 393 


To see how the number of zero energy states changes for finite t , let us power 
expand the operator d t as a function of t : 


d t =d + ta* i (dh/dx i ) + - 
d* = d* + ta i {dh/dx i ) + • 


(12.2.24) 


where we have introduced a 1 via 


and 


dco — a* 1 


dco 
dx* ’ 


d*co = a 1 


dco 
dx l ’ 


{a i ,a*>} = g i C 


(12.2.25) 


(12.2.26) 


Putting this back into the Hamiltonian acting on a form &>, we find the expansion 
in t: 


2H t co = (dd* + d*d)(o + t 2 g ij ^- ^-co + t[a*\ a ; ']D,£>, hco, (12.2.27) 

dx l dxJ 


where D t is the covariant derivative with respect to g lJ . 

To lowest order in t, we have shown that the t -dependent Hamiltonian con¬ 
tains a term proportional to the square of the gradient of h. The minima of the 
Hamiltonian therefore correspond to the critical points of the Morse function. 
If we power expand around one of these critical points, we find 


h(x ) = h( 0) + Xixf/2 + 0(x 2 ), 



(12.2.28) 


where is the Morse index at the critical point. 

The first two terms define the usual harmonic oscillator theory. The second 
term is also easily analyzed by noticing that 

[a*, a t ]dx A • • • A dx^ p = ±dx /Xl A • • • A dx^ p . (12.2.29) 

(The eigenvalue is H-1 if i is one of the indices /jlj appearing in the volume 
element and — 1 if it is not.) 

The energy is therefore given by the energy of a series of uncoupled harmonic 
oscillators plus a correction factor 

E t = \t E; [1^1(1 + 2 N t ) + hm] + O(t 0 ), (12.2.30) 

where n t = ±1. 



394 12. Topological Field Theory 


We are interested in the number of zero-energy solutions. The energy is zero 
if Ni = 0 and n t = -sign However, we recall that the number of negative 
eigenvalues A.,* is the Morse index p. Each critical point of the Hamiltonian 
thus defines a wave function whose energy is zero to order r, and there are M p 
such wave functions. 

However, in the limit as t approaches zero, some of these zero-energy states 
receive positive energy contributions, and hence the number of zero-energy 
states decreases. But, in the zero t limit, the number of zero-energy states 
equals b q , as we saw earlier. Thus, the number of zero-energy states at zero t 
(the Betti number) is less than the number of zero-energy states at small but 
finite t ( M q ): 


b q < M q (12.2.31) 

which is the Morse inequality derived from quantum mechanics. 

In summary, this simple quantum mechanical model reproduces a proof of 
the known Morse inequality conditions. However, using the power of super- 
symmetry, we can do even better than this. We can go beyond the standard 
inequalities of Morse theory and generate new information, that is, we can 
calculate the Betti numbers in terms of Morse theory. 


12.3 Sigma Models and Floer Theory 

To be specific, we will start with a supersymmetric nonlinear sigma model, with 
a metric gij(4>), which is a function of a scalar field <p'. We also introduce the 
Morse function h((p). In supersymmetric language, we introduce the superfield 
<t>': 


d>' =<t> i + 0^ +8dF‘/2. (12.3.1) 


The action is then [1]: 

S = j d 2 xd 2 e [g, 7 (<I>)D<f>' D<Z> j + h(<D)] 

= \ f d 2 x^g ij (4>)d^<p i d^cp J + igijtyWy* D nf } 

A7 At “ 

+ ^RaiM)if l ir l ir k f } - g ,J (4>)-rn - D i°j , 

0<p l 0(p J 

D»r = d^r + r;.*(tf>)a^V- 

(12.3.2) 

We can make contact with Morse theory by making the following iden¬ 
tification between cohomology operators d acting on p forms and the 
supersymmetric operator Q of the sigma model acting on states with fermion 



12.3 Sigma Models and Floer Theory 395 


number F: 

d ** <2, 
d* ** Q\ 

dd* + d*d <*2H = {{?, Q*}. 


(12.3.3) 


The supersymmetric sigma model thus gives us a specific realization of Morse 
theory. With this identification of Q as a cohomology operator d , we can use 
the previous discussion to prove the Morse inequalities. 

Our discussion so far has been perturbative. Now, however, we will take the 
formalism one step further, by analyzing tunneling between different critical 
points. Supersymmetry will guarantee that the energy of the Hamiltonian will 
vanish to all orders in perturbation theory; however, tunneling will in general 
lift some of the degeneracies among the zero-energy states. Tunneling via 
instantons can remove some of the critical points. The key point is that if we 
can calculate the number of critical points removed by tunneling, then we can 
calculate corrections to Eq. (12.2.31) and hence b p itself. 

Let | Pi ) represent the perturbative vacuum defined at one of the critical points 
Pi. We wish to calculate the matrix element between different critical points 
(Pi\d t \Pj). Normally, this matrix element is zero (because of the presence of 
fermionic zero modes). However, nonperturbative instanton effects can render 
the matrix element nonzero if the instanton effect produces a fermion zero 
mode, which is absorbed by d t . 

At this point, we use standard instanton arguments to calculate this ampli¬ 
tude. We wish to find solutions x(z) that take us from one critical point to 
another, parametrized by some z . Thus, we wish to find the equation for x(z) 
connecting two critical points, that is, x(— oo) = P t and *(oo) = Pj. 

The equation for the instanton is 


dx l {x) 

dr 


[*(*)] 


3ft[jc(r)] 

dxi 


(12.3.4) 


Given the instanton solution connecting two critical points, we can use tun¬ 
neling arguments to show that the matrix element connecting the two critical 
points is given by 

(Pi\d t \Pj) = n(P h Pj)ex p { - t[h(Pi) - h(Pj)]}, (12.3.5) 

where n(P l , Pj) is an integer that is computable once we are given the Morse 
function h. 

So far, our discussion has been rather general. Now, we come to the key 
point of our discussion: we will define a new cohomology operator 8 that will 
establish the link between the Betti numbers and Morse theory. 

Let us first define W p to be the set of eigenstates | P) such that fi{P) = p for 
some integer p. Let us define a new cohomology operator <5, defined in terms 



396 12. Topological Field Theory 

of the integer n(Pi , Pj), which is given by 

<510= n(Q,P)\P} (12.3.6) 

PzW p+l 

for Q e W p . Notice that the operator <5 takes us from W p to W p +\. We can 
also show that S 2 = 0 and that it satisfies all the properties of a standard 
cohomology. 

Given this new cohomology operator 5, we can then define the Betti number 
as 


b p = dim ([kerf/imS] n W p ). (12.3.7) 

We have now succeeded in our goal [1] of defining the Betti number of the 
manifold totally in terms of Morse theory. Using the tool of the supersymmet¬ 
ric sigma model, we have converted the old Morse inequalities into precise 
identities. 

Let us now generalize our discussion to the case of more complicated man¬ 
ifolds in three and four dimensions. By now, we see a strategy emerging for 
using quantum field theory to calculate the Morse invariants W q : 

(1) First, start with a supersymmetric quantum field theory where we can 
define a Morse function h(<&) and identify the supersymmetric operator Q 
with the nilpotent coboundary operator d. 

(2) Construct the ^-dependent Hamiltonian as a function of the Laplacian. At 
t = 0, the number of zero-energy states equals the number of harmonic 
forms, or the Betti number. 

(3) Calculate the Morse number M p as the number of zero-energy states and 
compare it to the zero-energy states of the t = 0 Hamiltonian, which are 
the Betti numbers. 

(4) Using instanton methods, calculate the transition matrix element between 
different critical points, and from this, define a new nilpotent <5 whose 
cohomology gives us the Betti numbers directly in terms of Morse theory. 


Let us now follow these simple steps and sketch how this approach can be 
applied to the case of the Yang-Mills theory. This will, in turn, give us entirely 
new topological invariants defined in three and four dimensions. 

We begin by taking the h function to be the Chem-Simons action in three 
dimensions [2]: 


h(A) = 



(12.3.8) 


The critical points of h are found by taking functional derivatives of the action 

dh(A)/dA?(x) = -e iJk Fj k (x)/2 = (12.3.9) 

Because the derivatives are proportional to the curvature tensor, the critical 
points correspond to the space of connections where the curvature vanishes, 



12.3 Sigma Models and Floer Theory 397 


that is, the space of flat connections. We will call the space of flat connections, 
modulo gauge transformations, Floer complexes. 

Repeating the same steps as before, we find that the cohomology operators 
are given by 


d = 


/ 


d 3 x \j/f 


8 

sAf( X y 


d* 


/ 


d 3 x \jf?(x) 


8 

8A°(x) 


(12.3.10) 


Generalizing Eq. (12.2.21), we now define e-dependent cohomology operators 
as 


d e ^e~ h/e2 de h ' e \ 
d* = e h/e2 d* e~ h/el 


We introduce the Hamiltonian via 


(12.3.11) 


2 e~ 2 H e = d e d* + d*d e . 

(12.3.12) 

Written out explicitly, we have 


H = J d 3 xTr(e 2 n“ 2 + e - 2 B? 2 + € ijk ir i D j f k ) 

(12.3.13) 

and 


■W—w 

(12.3.14) 


As before, we can construct W q as the number of zero-energy eigenfunctions of 
the Hamiltonian for finite e and b q as the number of zero-energy eigenfunctions 
at e = oo. 

We must now analyze whether nonperturbative effects play an important 
role. Let us analyze tunneling effects that connect different critical points. As 
before, the tunneling effects are determined by a differential equation in a 
fictitious parameter r connecting the different critical points. The analog of 
Eq. (12.3.4) is 


3A*( x, r)/3r = B?(x, r). (12.3.15) 

(This equation has a simple meaning. If we identify r as a time coordinate, then 
it is easy to see that the previous equation sets the electric field proportional 
to the magnetic field, that is, F = —F* and the curvature is self-dual. Thus, 
the tunneling effects are described by the standard instanton effects found in 
ordinary Yang-Mills theory.) 

We conclude this discussion by noting that Floer [2] was able to use the 
Yang-Mills theory to construct a new boundary operator 3 that was nilpotent 
and to define a new homology group, called the Floer group. From these, he 
was able to construct new topological invariants for three-manifolds, using 
physics as the crucial input in a purely topological formulation. 



398 12. Topological Field Theory 


Floer’s discussion, however, was incomplete because it left open the possi¬ 
bility of a generalization to a fully four-dimensional-type formulation. Atiyah 
then conjectured [6], and Witten later proved [7], that a four-dimensional gen¬ 
eralization of Floer theory should give the topological polynomial invariants of 
Donaldson. We now turn to a discussion of how to generalize this formulation 
to four dimensions and to a wide variety of other theories. 


12.4 Cohomological Topological Field Theories 

Other topological models, in which general covariance is exact, are the coho¬ 
mological models [7], where a background metric may be present, but where 
the Lagrangian is given by 


L = 0 (12.4.1) 

or a topological invariant, such as 

L-FaF. (12.4.2) 

This may appear strange, because then the system appears to be empty. Al¬ 
though the action is zero, the “physics” of the theory is to be found entirely in 
the field content and its gauge variation. We will find that, after gauge fixing the 
gauge fields, a nilpotent BRST operator Q arises, and the gauge fixed action, 
with its Faddeev-Popov term, is given by 

Lgf + f? = {Q, V}, (12.4.3) 

where V is some field composed out of the original fields and their ghosts. 

Most important, we will then take the variation of the Lagrangian with re¬ 
spect to the background metric to derive the energy-momentum tensor. Because 
the gauge fixed action is itself BRST invariant, we find that the variation of 
the action with respect to the background metric yields the energy-momentum 
tensor T a P via 

SL = \f Vgtg^T'p (12.4.4) 

and 


T a p = {< 2 , V a p} 


(12.4.5) 


for some field V a p. 

This last statement, that the energy-momentum tensor is BRST trivial, is one 
of the most important features of the cohomological topological field theories. 
Since BRST trivial operators vanish when inserted into a correlation function, 
it means that we are free to vary the background metric g^ v without changing 
the theory, that is, the theory is locally generally covariant. 



12.4 Cohomological Topological Field Theories 399 


For example, let us insert an operator O into a path integral 

Zo = / D<t>e ifLd4x {0}. (12.4.6) 

Let us now make a small BRST variation of the path integral labeled by e, 
where the action and the measure are both BRST invariant. The path integral 
becomes Z € , which equals Z. Then, we find 

0 = Z e — Z 0 = J D<t>e (Q e^ LdAx {0} - Z 0 
= J D<p e‘ f Ld4x (O +€{Q, O}) — Zo 
= J D<pe i f Ld4x e{Q,0}. (12.4.7) 

Thus, 

({Q,0}) = 0 (12.4.8) 

for any field O. (Another, more intuitive, way in which to see that cohomo¬ 
logical topological field theories are independent of the choice of metric is 
to notice that the metric tensor enters the theory through BRST gauge fixing. 
Since the metric is introduced as a gauge artifact, and since the physical proper¬ 
ties of a field theory are always independent of gauge fixing, the cohomological 
topological theories must also be independent of the metric.) 

One of the most important examples of such a cohomological theory is a four¬ 
dimensional topological Yang-Mills theory, which resembles a twisted version 
of supersymmetric N = 2 gauge theory and is the four-dimensional extension 
of the three-dimensional Floer theory. There are many ways to approach the 
quantization of this topological Yang-Mills theory. We will explore just a few 
of them. 

We begin with a theory of zero action, but then postulate that this action is 
invariant under the following gauge transformation: 

SAl = (12.4.9) 

Upon first glance, this is a highly unusual gauge transformation, much larger 
than the usual SU(N) gauge transformation. Because ^ has the same number 
of indices as A*, it implies that we can use i/s* to eliminate all the fields 
contained within A", that is, the theory is vacuous. However, several nontrivial 
features begin to emerge when we gauge fix this seemingly trivial theory with 
zero action [11-13]. 

First, let us choose a gauge so that the F“ v is self-dual 

gauge choice : F° v - F afia = 0. (12.4.10) 

Naively, we might believe that this gauge completely determines all fields of 
the theory, and hence, the theory is again vacuous. However, there is a subtle 



400 12. Topological Field Theory 


point here that will prove crucial in our later discussion. Although demanding 
that the curvature be self-dual fixes the infinite degrees of freedom within our 
fields, it is not sufficient to fix all finite degrees of freedom. As is well known 
in gauge theory, there are nontrivial solutions to the self-dual equation, given 
by instantons. In other words, after gauge fixing there are still finite degrees 
of freedom left in the theory given by the space of instantons. 

The space of parameters necessary to label one-to-one the space of instantons 
is called the “moduli space” of instantons (which in turn is intimately linked to 
the Donaldson polynomials). In fact, for each cohomological topological field 
theory, we will find that gauge fixing leaves finite degrees of freedom labeled 
by some moduli space. 

There is also a second complication, however. Since our theory is locally 
gauge invariant, we demand that our field transform under local SU(N), so 
that the total variation is 


8A a a =(D a 4>y + ift 9 (12.4.11) 

where 4> a is a SU ( N ) gauge parameter. Notice, however, that it is possible to 
absorb the (p a term completely into the f erm - This means that the gauge 
parameter has its own hidden gauge symmetry given by 

8^ = (D a <j>y. (12.4.12) 

Because of the tight relationship between the two gauge parameters (f> a and 
the Faddeev-Popov ghosts arising from gauge fixing will themselves have 
ghosts. In general, for complicated gauge choices, we will have to use the BV 
“ghosts-for-ghosts” quantization method. However, we will choose a simple 
enough gauge so that only second generation ghosts are required, so that the 
Faddeev-Popov prescription is adequate. 

With these preliminaries, let us begin the quantization of the topological 
theory with zero action. The usual prescription gives us the gauge fixing term 
and the Faddeev-Popov ghost term 

^gf+fp = (i/S)a 0 B afi B a p + (i/4)B^(F aP + F a p) - ix afi D a ^, (12.4.13) 

where a 0 is a constant, x and \/r are the standard Faddeev-Popov ghosts, such 
that x is self-dual, and B a p is a self-dual auxiliary field. 

Notice that we can, in turn, write this action as the off-shell, nilpotent BRST 
variation of the following term: 

Tgf+fp — 074)5! [x ap (F aP + F af) + \a 0 B aP )l (12.4.14) 

where 

&xK = r a , 

W = o, 

5,x“^ a = B afia , 

S x B afia = 0. 


(12.4.15) 



12.4 Cohomological Topological Field Theories 401 


(At this point, we have the option of solving for the B a p field via its equations 
of motion, so the action reduces to the square of the self-dual condition on the 
Yang-Mills field. However, the BRST invariance only holds on-shell, and we 
must use the BV quantization procedure.) 

As we noted earlier, there is still a hidden symmetry in the theory because 
the ghost field 0* has its own remaining ghost symmetry, parametrized by the 
ghost field (p a . The action possesses a hidden symmetry 


SgK = *(A,0) fl , 

s g b afia = -/* o [ 0 , 


(12.4.16) 


The field 0 has four degrees of freedom, but x has only three. We find that the 
“ghosts” themselves require more Faddeev-Popov fixing. 

To fix the remaining symmetry within the anticommuting Faddeev-Popov 
fields, we must introduce a set of commuting Faddeev-Popov fields, 0 and X, 
and an anticommuting field r /. Let us therefore introduce a new gauge fixing 
action with more Faddeev-Popov terms 

l gf+fp = SBRsr[icoHD a ilr a + sb) + CiX ap B afi ], (12.4.17) 

where S B rst = <$i + <$g- 

Let us now write the full action 

(L + L') gf+fp = -ix^Dafe - iT]D a x[f a + \XD a D a <j> 

- (i/2)e 0 X[r, f a ] ~ (i/%)e 0 <t>[x afi , X«p\ 

+ se 0 [i<l>[r}, rj] + (e 0 /4)[(p, A] 2 ] + \B afi B aP 
+ (i/4)B a P(F a p + F a p) 

= \{F 4- Ff - ix ap D a ir fi - 
+ (k/2)D a D a <t> - (i/2)e 0 k[rjr\ yfr a ] - (i/ 8)^[x“^ X*p] 
+ se 0 [i(l>[ii, t]] + (e 0 /4)[<f>, X] 2 ]. 

(12.4.18) 

In this action, we have replaced the b field in terms of a new field 77 , given by 
b = cq [0, r]]. The gauge symmetry is maintained by having <5 G X = 2rj and 
SqT] = — (i/2)eo[\fr, X]. We have made this replacement in order to preserve 
a symmetry arising from the Floer three-dimensional action, which is called 
U symmetry. With this replacement, the scaling dimensions and U weights 
of the fields (A, 0, X, 0, /) are given as (1, 0, 2, 1, 2) and (0, 2, —2, 1, —1), 
respectively. Also, we have made the choice c 0 = — \ and c\ = (We note 
that the action of Eq. (12.4.18) is not unique. By adding in a BRST variation of 
some arbitrary collection of fields, the theory remains the same. For example, 
the 0 field is inert under the BRST variation.) 

As we mentioned earlier, the hallmark of a cohomological topological theory 
is that its energy-momentum tensor is BRST trivial. By direct calculation, we 



402 12. Topological Field Theory 


can show that the energy momentum tensor is equal to 


Tap = {Q, k a p}, 

KfS = jTr (F aa Xp + FfiaXa - \g^FarX aT ) 

+ \ Tr {ir a Dp\ + x/fpD a k - g a p\lr a D a k) 
+ \gctfsTr(ri[<t>, *])• 


By direct BRST gauge fixing, we have therefore converted the original action, 
which was zero, into a BRST variation. There is, however, yet another way in 
which to construct a topological theory, and this is through its relationship to 
N = 2 supersymmetry. 

If we analyze the field content of the previous theory, we see that it is 
equivalent to an N = 2 supersymmetric Yang-Mills theory, but with an im¬ 
portant difference. An N = 2 theory possesses two spinorial supersymmetry 
generators, Q l a , where a is a spinor index. Our goal is to rewrite this theory 
such that we extract a single fermionic, nilpotent Lorentz scalar Q out of the 
supersymmetric generators. 

Normally, this is impossible, because an irreducible Lorentz spinor does not 
contain any scalars. But, this can be changed if we add a twist to the theory, 
such that the energy-momentum tensor is altered so that one component of Q l a 
becomes a Lorentz scalar. 

This twisting process is most easily represented on a two-dimensional theory. 
We begin with an N = 2 supersymmetry with an R symmetry associated with 
it, whose generator is R AA . Then, the supersymmetric generator is Q a ±, with 
commutation relations 


{ Qa +. Qp~) = Ya 
{Q a+ ,Qe + } = {Q a -,Qe-} = Q- 


(12.4.20) 


In two dimensions, a spinor has only two components, also labeled ±, so that 
we have four nilpotent supercharges Q±±. 

The key step is that we will now modify the theory so that the energy- 
momentum tensor becomes 


= T^ d a R v + € y(7 d°R^ (12.4.21) 

Altering the energy-momentum tensor means that we are also altering the 
rotation group generator 7, which only has one component, by J' = J + R. 

In this new basis, with an altered rotation group generator, we find that Q-+ 
and <2+- now transform as scalars. Since both are nilpotent, we now define 
the new BRST generator as 

<2brst = Q-+ + Q+- • (12.4.22) 

In this way, an N = 2 theory has now been modified so that a new scalar, 
nilpotent Qbrst operator can be constructed, such that the action becomes 
a BRST commutator. It is thus not surprising that topological field theories 



12.5 Correlation Functions 403 


have the same field content as N = 2 superfield theories, but with a different 
realization of supersymmetry and Lorentz invariance. 


12.5 Correlation Functions 

Because the underlying action is zero, one is tempted to conclude that the 
correlation functions must also be zero. However, this is not true. Because 
topological field theories are totally independent of the choice of background 
metric by construction, we find that the correlation functions reproduce the 
known topological invariants found by topologists. In fact, this was one of 
the original motivations for studying these topological field theories: to pro¬ 
vide a quantum field theoretical framework in which to generate topological 
invariants. 

Perhaps the most intriguing of these topological invariants are the Donaldson 
polynomials [4]. Surprisingly enough, Donaldson first analyzed instanton so¬ 
lutions to the Yang-Mills equation and their moduli space in order to construct 
his invariants. 

Because topological gauge theory is based on self-dual fields, one can show 
that the correlation functions of the theory are precisely the Donaldson poly¬ 
nomials. To see this, we want to construct correlation fimctions among fields 
that are BRST invariant, but not BRST trivial. In other words, we want a field 
whose BRST variation is zero, but cannot be written as a BRST commutator 
(in which case, as we have seen, its correlation functions are exactly zero). 

Examining the list of fields found in the topological gauge theory, we find 
that the only invariant field is </>. For a gauge invariant combination, we are led 
to choose the following gauge invariant, a BRST invariant field at point P: 

W 0 (P)= l 1 Tvcf>\ 

8Wo(P) = 0, (12.5.1) 

Wo^lQ, V }, 

for the group SU(2 ). (For higher groups, there are obviously more gauge 
invariants one can construct via (j) a , corresponding to the number of Casimir 
invariants of the group.) The correlation functions we are interested in are 

f k 

Z(k ) - / D<f> e~ r fj Wo(Pd = (Wo(Pi) • • • W 0 (P k )). (12.5.2) 

J 1=1 

The next step is to show that this is really a topological invariant, that is, it 
is independent of the location of the points P k . This is easily shown. Let us 
move the point P a small distance. Then, the variation of the W 0 is given by 

d Id 

W ° = 2 dx“ ( Tr< ^) = = HQ, Tr <pf a ). 


(12.5.3) 



404 12. Topological Field Theory 


that is, the variation of Wo is equal to a BRST commutator. Then, 

Wo(/>) - Wq(P') = f'j£dx a = { Q, J' W, } (12.5.4) 

where W\ = Tr(0i l/ a )dx a . 

Then, the variation of the correlation function by moving the point Pi is 
equal to the matrix element of a BRST commutator, so it vanishes 

8Z(k) = (SW 0 (PO • • • W 0 (P k )) = 0, / J p P j~[ Wo(P,)J j 

= 0 (12.5.5) 


as desired. 

Now that we have shown that the correlation functions composed of Wo are 
topological invariants, let us construct a sequence of these BRST invariant (but 
BRST nontrivial) operators. We notice that 


O = /{0,W o }, dW 0 = i{Q,W x }, (12.5.6) 


Let us now extract, from W x , a new BRST invariant operator W 2 , which is 
BRST nontrivial and so on [7]: 


dW x = i{Q, W 2 ], 
dW 2 = i{Q, W 3 }, 
dW 3 = i{Q , W 4 }, 
dW 4 = 0, 


(12.5.7) 


where 

W 2 = Tr A \fr + i(j) A F ), 

W 3 = i Tr (rfr A F), (12.5.8) 

W 4 = —j Tr (F A F). 

(Here, dW 4 = 0 because it is a five-form, which in four dimensions equals 
zero by the antisymmetry of dx a .) 

Let us now generalize these results for an arbitrary topological theory, where 
the invariant fields W* obey dW k = i[Q, W k+i }. Then, the integral 


I(Y) = 


l 


Wi 


around a ^-dimensional cycle y is BRST invariant, that is, 


(12.5.9) 



12.5 Correlation Functions 405 


This integral is only sensitive to the homology class of y, that is, if we add to 
y a boundary term 9/1, then it remains unaltered up to a BRST commutator: 


I{y + dp) = [ W k = I(y) + f dW k 

= /(/) + / f (G, W* +1 } = /(y). 

h 

Now, we construct the following topological invariant 


Z(y l9 ...,y r ) = 



(12.5.11) 


(12.5.12) 


The above correlation function is gauge invariant, BRST invariant, and inde¬ 
pendent of the location of the points where W ki are located. It is only dependent 
on the homology cycles y t . 

The four-dimensional invariants defined in Eq. (12.5.12) correspond to the 
Donaldson polynomials. Hence, quantum field theory yields a straightfor¬ 
ward way in which to generate analytical expressions for these complicated 
polynomials. 

To motivate this identification, let us note the following. Donaldson origi¬ 
nally found his polynomials by examing the moduli space of instanon solutions 
to the Yang-Mills equation. The dimension of the moduli space of instantons 
is given by 


dim M = 8 Pl (M) - \[x{M) + o(M)\ (12.5.13) 

where p\ is the first Pontryagin index, x is the Euler index, and a is the 
signature index of the manifold M. 

The moduli space of instantons arises when one looks for the solutions to the 
self-dual equation F = —F in Yang-Mills theory. To find the moduli space, 
let us make small variations in the field A appearing in the self-dual equation 


S(F + F) - D a S Ap - DpS A a + £ a p y8 D y 8 A 8 = 0. (12.5.14) 

Of course, we want a solution to these equations modulo gauge transformations. 
Thus, to eliminate this redundancy, we break gauge invariance by imposing 


DJA a = 0. (12.5.15) 

But, notice that these equations, which define the moduli space of solutions 
to the self-dual equation, are equivalent to the equations for the if/ equation 
arising from our action in Eq. (12.4.18). The x equation for f is 

- D^ a + € a p yS D y xjr & (12.5.16) 

while the i] equation for if/ is given by 


D a rfr a = 0. 


(12.5.17) 



406 12. Topological Field Theory 


These equations are identical to the equations defining the moduli space of 
instantons. More precisely, the numer of zero modes of \[r minus the number 
of zero modes of rj and x equals the dimension of the space spanned by these 
solutions, which is identical to the moduli space of instantons. So, the sums of 
the fermion zero modes must equal dim M. 

However, the U number of \js is equal to +1, while the U number of 77 and x 
equals — 1. Thus, we can alternatively count the U number of various operators 
to calculate the number of zero modes of these fermion fields and hence the 
dimension of moduli space. 

The U number of W*. is equal to 4 — k t . Thus, the correlation function 
appearing in Eq. (12.5.12) must satisfy 

]T(4 - kt) = dim M. (12.5.18) 


(If this equation is not satisfied, then the number fermion zero modes in the 
integration measure does not match the number of fermion zero modes in the 
integrand, so the correlation function vanishes.) 

Last, we can functionally integrate out the nonzero modes appearing in 
the integration measure, consisting of the integration over the various fields 
appearing in the model. Once all the nonzero modes have been integrated out, 
we have 


Z(y i, Yh • • •, Yr) = J (yi) A <D (y2) A • • • A d> (yr) , (12.5.19) 

where each of the 4> (y,) which remain after integrating over nonzero modes, 
is a 4 — kt form. We have now written the correlation function in terms of 
an integral over the moduli space of instantons, which is the desired form for 
comparison to the Donaldson polynomials. 


12.6 Topological Sigma Models 

One of the objectives of topological field theory is to analyze the possible 
“unbroken phase” of string theory, where general covariance is unbroken. As 
a consequence, we will now apply our knowledge of topological gauge theory 
to write topological sigma models and topological gravity in two dimensions. 
Perhaps these will give us prototypes for the true theory that we are seeking. 

As before, we start with a vanishing action, or an action that is purely 
topological, and proceed to quantize it. We start with the topological action 
[ 11 , 14] defined for the two-dimensional field X^, where fi is a space-time 
index 

J d 2 Z € ab J^d a X ,x d b X v = J J, 


(12.6.1) 



12.6 Topological Sigma Models 407 


where E is the two-dimensional world sheet, and T is space-time. We set 

J = \j lxv dX 11 AdX v , (12.6.2) 

J^ v describes the almost complex structure of space-time, such that dJ = 0 
and 

(12-6.3) 

As before, the entire field X^ is a gauge field and can be eliminated. We 
now fix the gauge for the theory. Let us choose the self-dual condition 

d a X» + € h a J£ d b X v = 0 (12.6.4) 

which eliminates all infinite degrees of freedom contained within the gauge 
fields (leaving only finite degrees of freedom). 

We now add the gauge fixing term and the ghost contribution to the action in 
the usual way. There are no “ghosts-for-ghosts” to complicate our discussion. 
We add to the action 


^OF+FP = ^[p^aX* + e b a J» d b x v - i H a n], (12.6.5) 

where 


Pl = *p;p b v ( 12 . 6 . 6 ) 

and H£ is a self-dual field. The transformations of the fields (in flat world sheet 
space) are given by 


SoX" = ieyf, 

SoX 11 = 0 , 

Soft = - iTlx v p K a ), (12 - 6 ' 7) 

W = ~^X v X X {K,y + - ir^x'H*]. 

Notice that H occurs quadratically in the action, and that it does not propagate. 
If we eliminate H via its equation of motion, we find 


1 = 


J d a X» d a X v + e ab J^ d a X M d b X v ) 

-ip^ a x^-\x K x x PlPa V K v X 


The BRST current is 


J a = gn'idaX* + d b X V )x“- 


( 12 . 6 . 8 ) 


(12.6.9) 


The elimination of the H field has resulted in the standard propagator of the 
X^ field. However, the metric of space-time is almost complex and simplifies 
further if it is Kahler. However, the physical interpretation of this is obscure. 

Once again, we see the same features emerging for the topological theory, 
that is, the action is zero and the gauge fixed action is a BRST commutator. The 



408 12. Topological Field Theory 


energy-momentum tensor is also BRST invariant, meaning that the theory is 
topological. In contrast to the previous case, however, there is now a restriction 
on the background metric. 


12.7 Topological 2D Gravity 


Normally, in critical string theory, the two-dimensional gravitational metric 
can be entirely gauged away. In noncritical dimensions, such as those found in 
matrix models, we find that two-dimensional gravity plays an essential role. 
Thus, the next theory we will examine is topological gravity in two dimensions 
[11,16,17]. 

As usual, we begin the action being zero or a topological term. (In two 
dimensions, the Einstein action itself is topological.) However, since there are 
many ways in which the theory can be formulated, we will take the one that 
most closely resembles the usual conformal field theory found in ordinary 
string theory. 

We begin by specifying the gauge fields. co ab = co ab dx M is the gauge field 
associated with the Lorentz group. However, in two dimensions, there is only 
one component to an antisymmetric second rank tensor, so we will simply 
describe this gauge field as co . Second, there is the gauge field of the translations 
for the Poincare group in two dimensions, the vierbein e a = e a ^ dx The fields 
are therefore {co, e + , e~}. 

Then, we will need superpartners for these fields in order to construct the 
BRST operator out of the supersymmetric operator. These superfields will be 
represented by {\ff 0 ,\jf + ,\Jf-}. 

The gauge choice we will choose is to set the curvatures to zero 

S = f(. 0 dco + n+De + + 7T _De~) + j (xo d^° + X+ D f + + X-^VO, 

(12.7.1) 

where n and x are Lagrange multipliers, which enforce the gauge constraints 
De + = de+ — co A e + , 


(12.7.2) 


De~ = de~ + co A e~, 

— co A l/f + + A \jf°, 

Dx/s~ = d\l/~ + co A \j/~ — e~ A \jr°. 

The action is still invariant under local Lorentz transformations, parametrized 
by a, and by diffeomorphisms in two dimensions, parametrized by $: 


Sco = da -h £ • dco, 

Se ± = ±ae ± + Z)($ - e±) + § • De ± . 
The supersymmetry interchanges the fields as follows: 

8 s co — \f/°, 8 s e ± = x// ± . 


(12.7.3) 


(12.7.4) 



12.7 Topological 2D Gravity 409 


If we now make the supersymmetry transformation on the transformation of 
co and e ± 9 we find the transformation of xjr under local Lorentz transformations 
and diffeomorphisms 


8f° =§ -df 0 , 

8^* = ±ae* + • r/r*) + f • f*. 


(12.7.5) 


We can now write the BRST structure of the theory. Let Co and c represent 
the anticommuting ghosts associated with local Lorentz transformations and 
diffeomorphisms, and let their supercounterparts be represented by yo and y. 

In terms of these ghost parameters, we can now write the BRST 
transformations for the fields to. i/Ai as 


8a> = xfro + dco, 
8x(r 0 — dy 0 , 

Sc 0 = yo, 

<5yo = 0. 


(12.7.6) 


The other variations are given by 

Be* = If* ± Coe* + D(c ■ e*), 

Sxjf* = ±c 0 1f* + D(c ■ f*) ± y 0 e* + D(y ■ e ± ), ? 

Sc = y + c • 3c, 

Sy = c • dy — y • 3c. 

There are some interesting features of this transformation. First, notice that 
correlation functions are made of BRST invariant fields; however, the likely 
candidate for a BRST invariant field is yo, which in turn can be expressed as 
the BRST transformation of another field. This means that, at first glance, the 
theory seems totally empty. The set of BRST invariant operators that are not 
in turn BRST variations of another field seems to be the null set. This is an 
example of what is called “equivariant cohomology,” the BRST is nilpotent 
up to a gauge-dependent parameter co. Thus, as we shall see, the theory is not 
totally empty. 

Let us now break the symmetries of the theory. We can choose the conformal 
gauge, so we can define a complex structure on the theory, so that 

e + = e 4>+ e ~ = e 4>- d z (12.7.8) 


We now break local Lorentz transformations by setting 

= $-. (12.7.9) 

Let us make a BRST variation of the previous gauge fixing condition. Then, 
we find 

c 0 — j(dc + c dcf) — dc — c dcj)). (12.7.10) 

If we let cj) — + 0_, then we can simplify the action and express it entirely 

in terms of 4> and its Lagrange multiplier. Also, we have, as in ordinary string 



410 12. Topological Field Theory 


theory, the gauge fixing of the vierbein, which creates a Faddeev-Popov ghost 
action. Adding both parts, we find that the bosonic part of the action becomes 


S B = J ztBB<p 4- J(b3c + b3c). (12.7.11) 

We now repeat all these steps for the fermionic part of the action. The 
luge yields 


MV UV VT IV^VUl Ull IXIVOV O 

superconformal gauge yields 


f+ = e^ty+dz, 

= e't’-i/f- dz, 

while breaking the super local Lorentz invariance yields 

t+ = f- 


(12.7.12) 


(12.7.13) 


Making a supersymmetric variation on c 0 in Eq. (12.7.10), we also find an 
equation for y 0 : 

y 0 = j(By + y B<p + c dip — By — y 3<p — c dip). (12.7.14) 

Thus, the fermionic part of the action now becomes 


St = j yd Bf + J (fi By + 0 By). (12.7.15) 

The final action is the sum of 5 B and 5 F . 

We are now in a position to write the complete BRST operator for the theory. 
The energy-momentum tensor is equal to the sum of two pieces 


7l = Bn d</) + B 2 7t + By 
Tgh = c 3b + 23c b + y Bfi + 2 By fi, 


(12.7.16) 


while the superconformal generator is also given by the sum of two pieces 

G l = By dip + B 2 y, 

G g h = c dp + 2 dc p. 

We also have the supersymmetry charge 


(12.7.17) 


= <j)(37T 


i (r + by). 


(12.7.18) 


Putting everything together, the total BRST operator is now given by 

0BRST = Qs + j> [c(T l + Tfiy + \T bc ) + yG h ]. (12.7.19) 


12.8 Correlation Functions for 2D Topological Gravity 

Now that we have an explicit representation of the action and BRST operator in 
terms of familiar conformal fields, we can write correlation functions, as in Eq. 



12.8 Correlation Functions for 2D Topological Gravity 411 


(12.5.12). As before, the correlation functions must be over BRST invariant 
fields that are not in turn BRST variations of other fields. 

Here, y 0 (like <f> appearing in the topological Yang-Mills theory) is a BRST 
invariant operator. However, unlike our previous case, it can be written as the 
BRST variation of another field cq- Normally, this means that all correlation 
functions made out of yo vanish. This means that the entire formalism collapses, 
and the theory is empty. However, topological gravity is an example of an 
“equivariant cohomology,” that is, the BRST operator is nilpotent, modulo a 
gauge-dependent field such as c 0 . In practice, when constructing correlation 
functions, we will find that they do not vanish, because of contact interactions. 

As in topological Yang-Mills theory, we have a sequence of operators that 
we can construct from yo. Repeating the sequence of manipulations developed 
in Eqs. (12.5.6)—(12.5.8), we can define the following sequence of operators: 

- y!) ’ 

da W = Sa (i) t ( 1 2 . 8 . 1 ) 

= to®, 


where the superscript ( n ) represents an n form on the Riemann surface. Since 
the differential of <7 n (,) is the BRST variation of er„ (,+l) , correlation functions 
composed out of cr^ are independent of the points at which they are defined. 
Explicitly, we have the following: 


^ ]) = nifoYo 

a„ (2) — nda> y 0 ”~' + \n{n - l)V^o A i^oKo” -2 - 


( 12 . 8 . 2 ) 


Given the form of the correlation functions, we can write recursion relations 
[18-21] between them that will be shown to be precisely the same as those 
found in the matrix model. 

The goal for the genus 0 recursion relation is to find a relation between cor¬ 
relation functions involving o d . and correlation functions where d t is replaced 
by di — 1. Then, by repeatedly applying this recursion relation, eventually o di 
can be reduced to correlation functions involving only a 0 = P, which are 
known. Thus, this recursion relation will be able to reduce all possible corre¬ 
lation functions to the known correlation functions of the puncture operator 
P. 

Let M g , s be the moduli space of a Riemann surface with genus g and 5 
punctures. The dimension of moduli space is 6g — 6 + 2s. For every a di , we 
can associate a 2d t form defined in the moduli space M g , s as in Eq. (12.8.2). 
Then, we have, as in the topological Yang-Mills theory 


{a dl a dt ■ ■ ■ cr d . 



A.(1) A A(2) A • • • a X (i ), 


(12.8.3) 



412 12. Topological Field Theory 


where twice the sum of the di must equal the dimension of moduli space 

2J2 d i = 6g-6 + 2s. (12.8.4) 

i 

Fortunately, because these theories are purely topological and hence indepen¬ 
dent of the location of the operators a n , the correlation functions must be 
topologically defined. Thus, the evaluation of these correlation functions can 
be accomplished by using counting arguments. 

Using purely topological arguments, we see that the correlation function is 
a function of the integration region of each k (i) , which we will call H (iy These 
//(,) can be taken to be cycles defined on moduli space. Since the correlation 
function is purely topological, we can take these H (i) to be homology cycles. 

Fortunately, it is possible to define a simple topological invariant out of 
these homology cycles. The intersection of these cycles is a number, which is 
topologically defined. Thus, we find that 

(a dl a i2 ■ ■ ■ a dn ) = #(tf (1) n H {2) n • • • n H (n) ) f] d t \, (12.8.5) 

i 

where we have normalized each A. (/) to be a 2d, form times d t \. 

Our goal is to calculate the correlation functions of topological gravity and 
compare them to the correlation functions found in matrix models. We must, 
therefore, find a way of reducing the d t appearing in the correlation functions. 
The key to constructing recursion relations for matrix models is to find the 
topologically defined operator for k^ appearing in the correlation function. It 
can be shown that the correct expression for this is given by 

( 12 . 8 . 6 ) 

where c\ is the first Chem class defined for the line bundle L(/> defined on 
the moduli space M gtS (or actually the moduli space of stable curves, which is 
obtained by compactifying M gtS by adjoining curves with double points). 

Our strategy is to reduce all the di appearing in the correlation function until 
we are left with the correlation function of products of do, which we identify 
as the puncture operator. The way to do this is to split off one of the c \: 

(12.8.7) 

and then perform the complex integration over the two variables appearing in 
L(/). In this way, we eliminate two moduli and reduce d t by one. 

Let us consider the case of a genus 1 surface. Then, the integral over the 
two moduli appearing in c\ is defined with certain poles and zeros. We must 
be careful to examine the case when the points n approach each other. 

Let us pick three points on the Riemann surface £, labeled z\ 9 z s - 1 , and 
z s , and perform the integral over the moduli associated with L^. Let 5 be the 
set of all other points. In general, we find that nodes appear in the integrand, 
resulting in the Riemann surface S splitting into two disjoint pieces, S] and 

s 2 . 



12.8 Correlation Functions for 2D Topological Gravity 413 


The only case of interest is when z\ appears on Si and z s -\ and appear 
on the other (the other possibilities do not contribute to the integral). In this 
case, the points appearing in the set S can be distributed over Si or S 2 . In fact, 
we have to sum over all possible distributions of the set of points S into the set 
X on Si and the set Y on Si. By performing the integration over two moduli, 
we find the recursion relation (for genus 0 only) 

(&diGd 2 * * * °d n ) = d\ (°4i-l rK'M'n Gd k Gd s - x Od\ (12.8.8) 

S=XUY ' jeX k€Y 

where we have summed over all possible ways in which the points in S can be 
distributed over Si and S 2 , and where we have inserted the puncture operator 
P at the node separating the Riemann surfaces Si and S 2 . (The puncture 
operator does nothing but give us a marked point on the Riemann surface.) 

Notice that the number of moduli matches. The correlation function on the 
left-hand side satisfies 

2]T4=2n-6 (12.8.9) 


since the genus equals zero. On the right-hand side, we have integrated over 
two moduli (which produced the node that split the Riemann surface S into 
two pieces). These two correlation functions satisfy 


2(di -l) + lJ2 d i= 2(»i) -6 + 2, 

i<EX 

2d s -\ + 2 d s + 'y ^ di = 2(rc 2 ) — 6 + 2, 

i€Y 


( 12 . 8 . 10 ) 


where n\ and n 2 are the number of Zi on each Riemann surface. By adding 
these two equations, we arrive at Eq. (12.8.9), as expected. The presence of 
the puncture operator P, which does nothing but give us a marked point at the 
node separating the two Riemann surfaces, was essential to get the counting 
of moduli correct. 

Let us use this recursion relation in order to calculate the matrix elements 
for the various product of operators. We begin by constructing the generating 
function for correlation functions. We assume that the action, which is zero, 
is supplemented by eP, where P is the puncture operator. (An analogous as¬ 
sumption was made for the matrix model case.) Let us make scaling arguments 
to assume that the matrix elements are powers of e. Then 

(■ Vn) = an € b « ( 12 . 8 . 11 ) 

for some a n and b n , which are as yet undetermined. 

We know that taking derivatives of this expression by e pulls down a P 
operator, that is, 

(a n P) - (9/ae){or„) = a n b n e h "~ l , 

(cr„PP) - (d 2 /de 2 )(a n ) = a n b n (b„ - l)e b ^ 2 . 


( 12 . 8 . 12 ) 



414 12. Topological Field Theory 


Now, let us insert these values into the recursion relations, which, for this 
simple case, read 

(cr n PP) = n(cr n ^P)(PPP), 

(a n a m P) = n[a„^P)[Pa m P), (12.8.13) 

(a„a m a p ) = n[o n ^P)[a m a p ), 

where we have normalized { PPP } = (l/jt)e (1/i:)_1 . 

Plugging in the values for the various correlation functions, we find a simple 
recursion relation in the a„ and b n , which allows us to calculate all of them. 
We find 

_ 1 

a " ~ (n + 1)[1 +(1 +n)/kY 


This then allows us to calculate the following expectation values 

€ \+(n+\)/k 

^ ~ {n + 1)[1 + (n + l)/£]’ 

^(n+m+D/fc 
n + m + 1 ’ 

{°n<y m cr p ) = I e ("+m+r+ 

These values (for the sphere) are precisely what we found for the matrix model 
in Eq. (11.5.27), showing that topological gravity, in some sense, is identical to 
ordinary two-dimensional gravity, at least for low genus. Repeating the same 
arguments for the arbitrary case, we find 

• • -cr ds ) = ^ e ("+DA-i (12.8.16) 

which reproduces the result of the matrix models. 



(12.8.14) 


(12.8.15) 


12.9 Virasoro Constraint, W-Algebras, and 
KP Hierarchies 

We still, however, have not used the full power of this formalism, which is 
capable of deriving the complete set of constraints satisfied by the generating 
functional to all orders in perturbation theory. These constraints, in turn, are 
equivalent to the string equation and the recursion relations found in the KdV 
formalism. We thus obtain a way to conveniently reformulate all the constraints 
found in matrix models in a more familiar language. 



12.9 Virasoro Constraint, W-Algebras, and KP Hierarchies 415 


At first, this may seem strange because the Green’s functions of topological 
gravity appear to be zero. This is because the physical operators a n are all BRST 
trivial, and hence the Green’s functions should all be zero, and the S matrix 
vanishes. However, a careful analysis of the Green’s functions shows that there 
is indeed a source of nontrivial contributions, and this comes from whenever o n 
approaches a m . In other words, the entire contribution to the Green’s functions 
comes from contact terms. This vastly simplifies the calculation of A-point 
functions, since all we have to calculate are the contributions of contact terms, 
which can be isolated using operator identities. 

For example, consider what happens when a n approaches a m . By explicit 
calculation, using the conformal field theory identities we have written down, 
we can show the following identity which isolates the contact terms 


/ o n \a m ) = \{2n + l)|a„ +m _i>, (12.9.1) 

Jd e 

where D € is a propagator representing an infinitesimal neighborhood separat¬ 
ing the two operators. (This identity can be proven by inserting the propagator 
between these two operators and then moving it to the right, where it annihilates 
on the vacuum.) 

In the same way, by carefully isolating contact terms, we can construct the 
entire set of identities satisfied by A-point Green’s functions. The calculation 
is rather intricate, so we will just present the final result of this calculation. Let 
5 be a collection of fields o m . This set can be broken up into two smaller sets, 
X and Y. Then the general recursion relation on genus g Green’s functions is 
given by [21]: 


meS 


(a n+l J”[ ojj = x{o„ J~[ ojj 

meS ^ 

+ y '(2j + l)l<rj+n H Cf m \ + {( <7 ;-l Cr n-J n ° m ) 

jeS m^j s j —1 meS s 

+ '2 e < i29 - 2 > 

^ <r=Yuy S 1 m <zY s 2 


s=xuy m( zx 

g=8l+82 


where x is the cosmological constant, and we have divided up the Riemann 
surface of genus g into two smaller Riemann surfaces of genus g\ and g2- 
This equation, although formidable looking, can be broken down into sim¬ 
pler components. The essence of this identity is that the major contribution to 
the Green’s function comes when or n+ i approaches the other oj . We saw earlier 
in Eq. (12.9.1) that the effect of this contact term is to generate (2 j + \)o J+n . 
That is the contribution found in the first term on the second line of the equa¬ 
tion. The last two terms in the equation must be added because the A-point 
function may develop nodes (e.g., the Riemann surface may fission into two 
pieces with genus g\ and gi) and we must insert operators at the nodes. 



416 12. Topological Field Theory 


This identity, although it summarizes all the information contained within 
the Green’s functions, is still unwieldy. It simplifies enormously, however, 
if we reexpress this in terms of operators. Let the generating functional be 
represented as Z(r 0 , t u ...), where the t, are the sources for the a,. Taking a 
derivative with respect to t, simply pulls down the operator a, into the functional 
integration. Let us now rewrite Eqs. (12.9.1) and (12.9.2) in terms of operators 
acting on Z(r 0 , t \,...). Let us define L n as the operator which pulls down cr n+l . 

Now apply this operator L n on the generating function twice. Then the 
essence of Eq. (12.9.1) is that the commutator of two L’s acting on the gen¬ 
erating functional yields the algebra [L„, L m ] = (n - m)L n+m -|-, which 

resembles the Virasoro algebra! Encouraged by this result, we suspect that 
the complete algebra contained within Eq. (12.9.2) is precisely the Virasoro 
algebra. It is gratifying to note that we can now summarize the entire content 
of Eq. (12.9.2), which in turn contains all the constraints of the matrix models, 
in one equation 


L n x = 0, n > —1, 

where r is the square root of the generating function 
Z(t 0 , t\, ...) = r 2 (to,h ,...) 
and the explicit expression for the L n ’s is given by 

00 a 

L -\ = £(™ + 5 ) t m —— + 


m =1 
00 


m -1 


L 0 = + + 


m= 0 
00 


L„ = ^(rn + l)r m --^- + lX 2 ^ 9 

^0 dtm + n " 


(12.9.3) 

(12.9.4) 


(12.9.5) 


where k is the string coupling constant. 

If we let the operator expression for L n operate on r, we obtain the constraints 
(12.9.2). Remarkably, the entire content of the recursion relations can now be 
expressed compactly in one equation! 

Furthermore, we see that the L n operators, which pull down cr n+ \ into the 
generating function, satisfy precisely the algebra of the usual Virasoro opera¬ 
tors. However, these are not the ordinary Virasoro operators; instead of acting 
on the space of parametrizations of the world sheet of the string, these oper¬ 
ators act on the space of physical operators appearing within the generating 
functional. Although the algebra is the same, the physical content appears to 
be entirely different. 

We stress that these recursion relations for the topological field theory 
are identical to the recursion relations found in matrix models using the 
Schwinger-Dyson equations [21-22]. Thus, the equivalence of the matrix 
model to the topological field theory approach is established to all orders in 



12.9 Virasoro Constraint, W-Algebras, and KP Hierarchies 417 


perturbation theory. Both approaches (the contact algebra of topological field 
theory and the Schwinger-Dyson equations of matrix models) yield the same 
Virasoro conditions. 

So far, we have only treated pure topological gravity, which has only one 
primary field, given by the puncture operator P. Encouraged by our surprising 
success in reformulating the recursion relations of matrix models in terms of a 
Virasoro constraint, we are led to examine whether the recursion relations for 
the higher matrix models can also be reproduced in this way. 

We can generalize the Virasoro constraint by coupling topological gravity 
to topological minimal models [23]. The advantage is that we can expand the 
number of primary fields and hence derive a larger set of identities satisfied by 
the generating function. 

Let us call the primary fields of the topological minimal models 14, and the 
expanded set of physical operators is given by the product of 14 with the old 
physical operators cr n : 


v n ,k = VkY 0 n P, (12.9.6) 

where P is the puncture operator. 

As in the purely gravitational case, we find that the Green’s functions are 
all zero, because the physical operators are all BRST trivial, except for the 
possibility of contact terms. By explicit calculation, we find that the contact 
algebra is given by 

f i \ f i \ h(kn-m) 

/ cr mik \cr n )-l o n \o m , k ) = ———— \cr m + n -i,k)- (12.9.7) 

Jd € Jd € h + 1 

The important point is to notice that the right-hand side contains the factor 
(kn — m). This means that the algebra generated by this extended theory is 
not the usual Virasoro algebra. This extended algebra must therefore be an 
extension of the Virasoro algebra. If the operator algebra L n pulls down the 
term a n+ \ and a new operator W^ +l) pulls down the term o m + ljJk , then the 
commutator between these two operators must include 

[L n , W<f +1 >] = {kn - + • • •. (12.9.8) 

The only algebra of this type which includes the Virasoro algebra as a subset 
is the W-algebra of Zamolodchikov [24]. 

Arguing from purely theoretical grounds, Zamolodchikov investigated gen¬ 
eralizations of the Virasoro algebra with operators of various conformal spins. 
He found that the addition of a conformal spin-3 operator generated a larger al¬ 
gebra than the usual Virasoro algebra. If we define W^ 2) = L n and W n (3) = W„ 9 
then the algebra becomes 


C ? 

[L m , L n ] = (m - n)L m + n + —m(m - 1), 
[L m , W n ] = (2m-n)W m+n , 




The last commutator contains terms which are bilinear in the Virasoro gener¬ 
ators. This means that the algebra is not a standard Lie algebra. However, one 
can show that the Jacobi identities still close. 

Although the details for the full theory have yet to be worked out in detail, 
we conjecture that the full set of constraints for the p — 1 matrix model are 
given by: 

W n W T = 0, 2 <k<p, n >-k+\. (12.9.11) 

For the case p — 1, the one-matrix model, this reduces to the ordinary 
Virasoro condition found earlier. However, for higher matrix models, we find 
a series of nontrivial constraints. Since the Jacobi identities are satisfied by the 
W (k \ we are guaranteed to have a set of self-consistent equations. Similarly, for 
the (p, 4 )-matrix model, it is also possible to use the generalized W algebras 
which can reproduce the constraints for these models as well. 

We can also take this formalism one step further. In the same way that 
we found the KdV hierarchy emerging from the one-matrix model, these 
higher identities should emerge as a generalization of the KdV hierarchy. This 
generalization is called the Kadomtsev-Petviasvili (KP) hierarchy. 

To see how the higher W-algebra constraints emerge from the KP hierarchy, 
let us first define the pseudodifferential operator 

L = d + u 2 (ti)d- 1 + u 2 (ti)d~ 2 + • • •, (12.9.12) 

where the u ’s are functions of f, (which will be linearly related to the r, discussed 
earlier), d is d/dto , and (L n )+ represents taking the positive differential part 
of L n . 

To constrain the u functions, we impose 

^-L = [(L n )+,L], n = 1,2,3,.... (12.9.13) 

dt n 

Once this constraint is placed, the remarkable feature of this formalism is 
that the entire system can be rewritten in terms of a new function, called the 
Hirota r function, which satisfies 


d 2 

dt 0 3t n 


ln r = (L” +1 


)-i. 


d 2 

dti dt n 


lnz= 2(L n+1 



(12.9.14) 



12.9 Virasoro Constraint, W -Algebras, and KP Hierarchies 419 


and so on. 

Lastly, to reduce this KP hierarchy to simpler hierarchies, we will impose 
one more additional constraint. The p reduction of the KP hierarchy is defined 
by stating that the u ’s have no dependence on t p -\, ti p - \, h p -\, • • •, and 

(Z/)_ = 0, (12.9.15) 

where (L p )_ stands for taking the part negative in d. The 2-reduction of the KP 
hierarchy is called the KdV hierarchy, encountered earlier for the one-matrix 
model, and the 3-reduction is called the Boussinesq hierarchy, which describes 
the two-matrix model. In this way, we can incorporate the p-matrix model as 
part of a KP hierarchy. 

The last step is to compare the constraint Wj® r = 0 with the constraints 
coming from the KP hierarchy. By carefully expanding both constraints in 
terms of /* , we find that the r function appearing in the W -algebra constraint 
is precisely the Hirota r function, and that the two sets of r 2 are linearly related 
to each other. 

Example: One-Matrix Model 

To see how this happens, it is useful to take a specific example, the one-matrix 
model. There is a conjecture that the Virasoro constraint L n z = 0 forn > — 1 
is equivalent to the constraint L_i x =0 with the added condition that r satisfy 
the KP hierarchy. 

Assuming that this is true, let us take the constraint L_i r = 0 and apply d 
to it. Then this constraint reduces to the following: 

( oo a 2 \ 

+ I> + \ytmT-r— ) r = 0. (12.9.16) 

m=\ 9r 0 dt m~l } 

Now assume that r satisfies once reduced KP hierarchy. This means that 
(L 2 )_ = 0, which in turn can be shown to lead to 

(L 2 *- 1 )-! = 2R k (-2u 2 ), k>\, (12.9.17) 

where R k are the familiar Gelfand-Dikii polynomials. 

Because r satisfies the KP hierarchy, it also satisfies 3 2 /3to3f m lnr = 
(L m ~')_i. Now substitute this expression into (12.9.16), and we are finally 
left with 

\*r\ + E“=i( 2m + !)*»*» - 0 (12.9.18) 

which is the string equation found earlier using matrix models. 

In summary, it is quite remarkable that so much nontrivial, nonperturbative 
information can be so compactly represented in terms of a W -algebra constraint 
on the generating functional. A vast amount of information, summarizing 
the complex interactions over Riemann surfaces of all genus, is succinctly 
compressed into these IT-algebra constraints. 

Lastly, we remark that these constraints, in turn, look suspiciously like Ward 
identities on a string field theory. In fact, it is possible to reexpress topological 



420 12. Topological Field Theory 


field theory in second quantized language and use the gauge invariance of the 
theory to write the Ward identities for the theory. When this is done, we find 
that we reproduce precisely the W-algebra constraints [25]. In other words, the 
fact that the W -algebra constraints are all self-consistent is due to the fact that 
the Ward identities are manifestations of the gauge invariance of the theory. 
Since the theory is invariant under multiple gauge transformations, we find 
that the Ward identities must also be self-consistent among each other, which 
is guaranteed by the Jacobi identities satisfied by the W algebra. 

Thus, the remarkable self-consistency of the tower of W -algebra constraints 
can now be seen to be the self-consistency of the gauge transformations of string 
field theory. 


12.10 Summary 

High-energy and high-temperature behavior of string amplitudes demonstrates 
the possibility of a phase transition and the restoration of vast symmetries. 
These new symmetries may indicate the reemergence of general covariance 
as an exact symmetry of the system. Usually, we break general covariance in 
the quantization scheme by power expanding the metric around some classical 
solution. However, if we were to quantize the theory without making such an 
unnatural split, then we might see the “topological phase” of the underlying 
theory. Thus, Witten’s topological field theory is an attempt to construct the 
“unbroken phase” of string theory, where general covariance is unbroken. 

Two types of topological field theories exist. The first type involves covariant 
theories without a metric tensor, such as the Chem-Simons gauge theory or 
the 2 4-1 gravity theory, which can be written as 

R% = dja£ + <#c$-U*»k). 

The second type of topological field theory involves the cohomological 
theories. In general, they have an explicit dependence on the metric tensor, 
but the final correlation functions are independent of the geometry of space- 
time. Thus, their correlation functions must be topological invariants. This 
means that we can use cohomological topological field theories to generate 
analytic expressions for the various topological invariants that have been re¬ 
cently written in three and four dimensions. Quantum field theory is then being 
used to answer difficult questions in pure mathematics, such as Morse theory, 
Floer complexes, and Donaldson polynomials. 

In the cohomological theories, the action is zero, 

L = 0, (12.10.2) 

or a topological invariant, such as 

L ~ F A F. (12.10.3) 



12.10 Summary 421 


In the cohomological theory, the content of the theory lies in the field variations, 
which give us a Faddeev-Popov ghost contribution and gauge fixing part. The 
important fact is that the total gauge fixed action is a BRST commutator 

f'GF+FP = {Q, V}. (12.10.4) 

More important, the energy-momentum tensor is BRST invariant 

Tap = { Q , v afi ] (12.10.5) 

which means that, even if the background metric occurs after gauge fixing, 
the correlation functions are independent of the choice of metric. Since the 
background metric arises as a by-product of BRST quantization and since the 
physics should not be altered by the details of BRST quantization, the final 
theory should be background independent. 

For example, in the four-dimensional Yang-Mills case, the symmetry of the 
action is large enough to completely eliminate the connection field. We will 
fix the gauge by demanding that the curvature be self-dual 

gauge choice: F“ v - ^ va/} F a P a = 0. (12.10.6) 

Normally, this is enough to make the theory empty, since the solutions of this 
gauge condition include zero. However, although this gauge is strong enough to 
eliminate the infinite degrees of freedom of the connection field, it is not strong 
enough to eliminate the finite degrees of freedom. As is well known, instantons 
are finite-dimensional solutions of this constraint. Thus, it is not surprising that 
the moduli space of instantons should play a key role in topological Yang- 
Mills theory. This is indeed the case, because the correlation functions will 
be topological invariants defined on four-dimensional manifolds; for example, 
we will find an analytic derivation of the celebrated Donaldson polynomials. 

The gauge fixed action, in the BRST formalism, becomes a BRST 
commutator 

^GF+FP = &l{x a ^[FaP + F a p — B a p]}, 

where 

ms = 

Ma = 0, 

= B aPa , 

= 0 . 

The quantization is not yet over, because the theory possesses a hidden 
symmetry 


(12.10.7) 


( 12 . 10 . 8 ) 


SoK = i{D a <f>Y, 
SaB afia = X a0 T- 


(12.10.9) 


Since the ghosts themselves have a gauge degree of freedom, we must intro¬ 
duce “ghosts-for-ghosts” to completely eliminate all local invariances. Once 



422 12. Topological Field Theory 


this remaining symmetry is gauge fixed, we can obtain the final action for 
topological Yang-Mills theory. 

Alternatively, we could have used the quantization method of Batalin- 
Vilkovisky, or we could have observed that the N = 2 supersymmetric 
Yang-Mills theory has a field content almost identical to the topological ver¬ 
sion. By “twisting” the N = 2 supersymmetric Yang-Mills theory, we can 
convert one of the supersymmetry generators Q‘ a into a genuine nilpotent 
Lorentz scalar, which we can then define to be Qbrst- In this way, we arrive 
at the identical action. 

To see this, observe that in N = 2 superconformal theories we have the 
supersymmetry generators: 


{Qa+t Qp -} — Yap^H’ 

{Qa+i Qp+} = {Qa-? Qf>—} = 0* 


( 12 . 10 . 10 ) 


In two dimensions, a spinor has only two components, also labeled ±, so that 
we have four nilpotent supercharges Q±±. The key step is that we will now 
modify the theory so that the energy-momentum tensor becomes 


r; v = T„ v + d a R v + e va d°R ll , (12.10.11) 

where R fl is the current associated with R symmetry. Then, because the struc¬ 
ture of the Lorentz group has been altered, we can extract a genuine Lorentz 
scalar out of the supersymmetry charges and call it the BRST charge 

£2brst = Q-+ + Q+-■ (12.10.12) 


To find the correlation functions of the theory (which will generate the 
Donaldson polynomials) we observe that there is a BRST invariant scalar <j> 
in the Yang-Mills theory. Defining W 0 = | Tr <fi 2 , we can develop a chain of 
BRST invariant operators 


0 = i{Q,W 0 }, 
dW 0 = i{Q, W:}. 


(12.10.13) 


Let us now extract from W] a new BRST invariant operator ITS, which is BRST 
nontrivial, and so 


dW l =i{Q,W 2 }, 
dW 2 = i[Q, Wi), 
dWi = i{Q,W 4 }, 
dW 4 = 0, 


(12.10.14) 


where 

W 2 = TrQi^ + i<p A F), 
W} = i Tr (xjr A F), 

W 4 = Tr (F A F). 


(12.10.15) 



12.10 Summary 423 


In this way, we can construct operators W n , which can be inserted into the 
correlation function to obtain topological invariants. 

A wide array of cohomological topological field theories exist, depending on 
whether topological invariants can be constructed for them. For the a model, 
for example, the action can be taken to be 

J d 2 ze ab J^ v d a X tl d b X v = j J, (12.10.16) 

where E is the two-dimensional world sheet, and T is space-time. We set 

J = \J liV dX li /\dX v . (12.10.17) 

The gauge fixing condition is that the derivative of X is self-dual 

daX* + e b J?d b X v = 0. (12.10.18) 

The action is then obtained by straightforward gauge fixing 

Egf + fp - ^[p;(daX» + € b a j; d b X v - \HZ)\ (12.10.19) 

where 

Pl = 4Kp" ( 12 . 10 . 20 ) 

and is a self-dual, commuting ghost. The transformations of the fields (in 
flat world sheet space) are given by 


8 0 X» = i€ X \ 

SoX* = 0, 

SoPa = - ira K X v P K a), (12.10.21) 

W = -t[\x v x\K U : + RwJ^Jy)Pa - i^ K X v tia\ 

Here, H occurs quadratically in the action and it does not propagate. If we 
eliminate H via its equation of motion, we find 


/ = J d 2 z[\(g^d a X» d a X v + e^J^daX* d b X v ) 

- ip“ d a x» - \x K X x PlPavKl\ (12.10.22) 

Similarly, two-dimensional gravity can be made into a topological theory. 
The gauge choice we will choose is to set the curvatures to zero: 

S — f (7t 0 dco + 7t+De + + n-.De~) + f (xodx/f° + / + I>V r+ + X-^VO- 

(12.10.23) 



424 12. Topological Field Theory 


where n and x are Lagrange multipliers, which enforce the gauge constraints: 
De+ = de + — co A e + , 

De~ = + a) a e”, 

. , . . . o (12.10.24) 

Dxj / + — co A \js + + e + A \j/°, 

Dxfr~ — dxj/~ + co A \jr~ — e~ A \f/°. 

To fix local Lorentz invariance, we decompose the zweibein 


e + — e* + dz. 


Q - 


e^'dz. 


(12.10.25) 


and set the gauge 


<p + =<p-. 


(12.10.26) 


The final action, for both the bosonic and fermion parts, reduces to 


S B = j nd d<p + j (bdc + b dc ) 

J x$W + fiPdy + Pdy)- 


(12.10.27) 


(12.10.28) 


The final action is the sum of S B and S F . 

We are now in a position to write the complete BRST operator for the theory. 
The energy-momentum tensor is equal to the sum of two pieces 


T L = dnd<p + dx + d X df, (12.10.29) 

T g h = c db -j- 2 dc b + y d/3 -T 2 dy f5, 

while the superconformal generator is also given by the sum of two pieces 

g l = dxd<f> + d 2 x, 3 

G gh = cdfi + 2dc f}. 

We also have the supersymmetry charge 


(12.10.30) 


Qs = (b(d7T f + by). 


(12.10.31) 


Putting everything together, the total BRST operator is now given by 

Sbrst = Qs + £ [ c (^l + Tp y + \Tbc) 4- yGx]. (12.10.32) 

We can form BRST invariant objects o n by iterating the field y 0 , the ghost 
associated with super-Lorentz transformations. The matrix elements of these 
operators are 

[a dl o d2 ■ ■ ■ <7 dri ) = #(//(!) (T H( 2 ) n • • • n H in) ) ]""[d,!, 


(12.10.33) 



12.10 Summary 425 


where we have normalized each A(,) to be a 2d, form times d t ! and where //, 
are homology cycles on the moduli space of flat connections. 

Because the correlation function is defined topologically, we can, by reduc¬ 
ing the index n of the BRST invariant field, develop recursion relations for the 
matrix elements. The recursion relation is (for genus 0 only): 

(o dl a d2 ■ ■•o dn ) = d x U d ^iY[o dj P\{pY\o dk o ds ^a d \. (12.10.34) 

S=XUY ' jeX k€Y 


Last, using these recursion relations, we can completely calculate the matrix 
elements for the BRST operators: 


{*«) = 
( Gn^m ) — 
(cr n o r m o r p) = 


6 l+(n+l)/* 

(/I + l)[l+(/I+l)/*]’ 

g(n+m+l)/ k 

n + m + 1 ’ 

}_^(n+m+p+l)/k-\ 

k 


(12.10.35) 


Comparing these with the matrix elements found in the matrix models ap¬ 
proach, we find they are the same. Thus, it can be shown, by calculating 
the recursion relation for correlation functions, that matrix models for two- 
dimensional gravity and topological gravity are the same. In retrospect, this 
may not be too surprising because both theories have finite degrees of freedom 
and both theories are topolological. 

The equivalence between matrix models and topological field theories can 
also be generalized to include all orders in perturbation theory. It is possible 
to write down generalized recursion relations in topological field theory for 
the generating functional Z(*o, h ,...). When rewritten in terms of Z, these 
recursion relations for two-dimensional gravity assume the remarkable form 


L n z(t 0 , • • •) = 0. n> 1, (12.10.36) 

where r = \[Z and where L„, although they obey the same relations as the 
usual Virasoro algebra, is defined in the source space {r t } rather than on the 
world sheet. Thus, these new Virasoro operators have an entirely different 
meaning than the usual ones. 

When written for two-dimensional gravity coupled to topological conformal 
matter, we find the constraints given by an extension of the Virasoro algebra, 
which is the IT-algebra 

W™ r=0, 2 <k<p, n > —k + 1. (12.10.37) 

We stress that these two remarkable equations were derived independently 
using matrix model techniques [22]. 

We find that we can summarize a vast amount of nonperturbative information 
very succinctly in these constraint equations using the W algebra. 



426 12. Topological Field Theory 


Lastly, we point out that it is possible, in turn, to reexpress these W -algebra 
constraints as Ward identities on a second quantized topological string field 
theory. Thus, the W-algebra constraints are nothing but an expression of the 
gauge invariance inherent within string field theory [25]. 


References 


1. E. Witten, Nucl. Phys. B202, 253 (1982); J. Differential Geom. 17, 661 (1982). 

2. A. Floer, Bull. Amer. Math. Soc. 126, 335 (1987); Comm. Math. Phys. 118, 215 
(1988). 

3. M. Gromov, Invent. Math. 82, 307 (1985). 

4. S. Donaldson, J. Differential Geom. 18, 269 (1983); 26, 397 (1987). 

5. V. F. R. Jones, Bull. Amer. Math. Soc. 12, 103 (1986); Ann. of Math. 126, 335 
(1987). 

6. M. F. Atiyah, in Symposium on the Mathematical Heritage of Hermann Weyl, 
University of North Carolina Press, Chapel Hill (1987). 

7. E. Witten, Comm. Math. Phys. 117, 353 (1988). 

8. E. Witten, Comm. Math. Phys. 121, 351 (1989). 

9. E. Witten, Nucl. Phys. B311, 46 (1988/1989). 

10. P. Van Baal, CERN-TH.5453/89 (1989). 

11. D. Montano and J. Sonnenschein, Nucl. Phys. B313, 258 (1989); Nucl. Phys. 
6324,348(1989). 

12. S. Ouvry, R. Stora, and P. Van Baal, Phys. Lett. 220B, 159 (1988). 

13. J. M. F. Labastida and M. Pemici, Phys. Lett. B212, 56 (1988). 

14. L. Baulieu and I. M. Singer, Nucl. Phys. Suppl. 5B, 12 (1988); see also L. Baulileu 
and B. Grossman, Phys. Lett. 212B, 319 (1988). 

15. E. Witten, Comm. Math. Phys. 118, 411 (1988) 

16. J. Labastida, M. Pemici, and E. Witten, Nucl. Phys. B310, 611 (1988). 

17. J. Distler, PUPT-1161 (1989). 

18. E. Witten, IASSNS-HEP 89/66 (1989). 

19. R. Dijkgraaf and E. Witten, IASSNS-HEP 90/18 (1990). 

20. E. Verlinde and. H. Verlinde, IASSNS-HEP 90/40 (1990). 

21. R. Dijkgraaf, H. Verlinde, and E. Verlinde, Princeton preprint PUPT-1194 (1990). 

22. M. Fukuma, H. Kawai, and R. Nakayama, Tokyo preprint UT-562 (1990). 

23. K. Li, CALT-68-1662. 

24. A. B. Zamolodchikov, Theoret. Math. Phys. 65, 1205 (1986); see also A. Bilal 
and J. L. Gervais, Phys. Lett. 206B, 412 (1988). 

25. M. Kaku, CCNY preprint (1991). 



CHAPTER 13 


Seiberg-Witten Theory 


13.1 Introduction 

The theme of this book is to find the true vacuum of string theory. So far, we have 
developed several approaches to solving string theory, including conformal 
field theory, topological methods, two-dimensional matrix models, and string 
field theory. Although each of them gives insight into the perturbative nature of 
string theory, none of them has succeeded in penetrating into its nonperturbative 
region in D > 4. 

For example, conformal field theory is a powerful method to categorize 
the various vacua of string theory. However, millions of such conformal field 
theories have been discovered, none of which quite matches the observed 
spectra of particles. With a few surprisingly simple assumptions, we can derive 
many of the features found in subatomic physics, but we have not yet found 
the Standard Model among the conformal field theory vacua. 

Similarly, string field theory has not yet lived up to its promise. Although 
string field theory is defined independent of perturbation theory, no one has 
been able to solve for the nonperturbative region of string field theory. Non¬ 
perturbative phenomena are notoriously difficult to solve in four-dimensional 
point-particle field theory, let alone string field theory in 10 dimensions. 

Unlike conformal field theory or string field theory, two-dimensional matrix 
models have given us insight into the nonperturbative region of string theory. 
However, many of the techniques of two-dimensional physics do not neces¬ 
sarily carry over to four dimensions, let alone 10, and hence two-dimensional 
matrix models have only given us helpful hints rather than solid results about 
string theory’s nonperturbative behavior. 

At present, the only systematic way to explore the nonperturbative region of 
string theory is through duality. Not only has duality given us an unprecedented 




428 13. Seiberg-Witten Theory 


look into the strong coupling region, it has revealed the existence of an 11- 
dimensional M-theory, which allows us to unify all five known superstring 
theories into a single theory. From duality, we can see that the spectra of string 
theory includes not only strings, but other exotic objects, such as p-branes, 
D-branes, black holes, etc. These soliton-like objects reveal the surprising 
richness of string theory, which has surprised physicists. 

For example, when applied to supersymmetric theories in four dimensions, 
supersymmetry and duality are often strong enough to probe the nonperturba- 
tive region and even give us exact results. By assuming that the prepotentials 
and superpotentials of supersymmetric gauge theories are holomorphic, we 
can, in fact, often determine the entire function by isolating its singularities and 
asymptotic behavior [1-4]. This, in fact, led Seiberg and Witten to completely 
solve N = 2 supersymmetric SU( 2) gauge theory [5, 6]. Supersymmetry, it 
turns out, is such a powerful constraint on a gauge theory that one can solve 
some of them completely. 

We will see that many of the qualitative features conjectured by Mandel¬ 
stam, ’t Hooft, Polyakov, and others for gauge theories can be concretely 
realized in supersymmetric gauge theories. In particular, dual relations have 
been established linking 

weak coupling ** strong coupling, 
electric phase ** magnetic phase, (13.1.1) 

Higgs phase ** confinement. 

Because of the astonishing array of dualities that can be written, it forces 
us to question what is truly “fundamental” about a theory. Since dualities can 
be made between weakly coupled electrically charged particles and strongly 
coupled magnetic monopoles (which can be viewed as composites of the elec¬ 
trons), then both the particle and its bound states are treated equally. In fact, 
it is impossible to make a fundamental distinction between the particle nature 
of the theory and its composites. Hence, the notion of a “particle” is no longer 
really fundamental, at least from a dual point of view. Likewise, since the dual 
theory may have an entirely different gauge group, we see that local gauge 
symmetry is also probably not fundamental. Because strings can be dual to 
membranes, it also probably means that strings, at least in this viewpoint, are 
not any more fundamental than membranes. In fact, there may be a “p-brane 
democracy” among all the various p-branes. 

In this chapter, we will first explore the implications of duality for four- 
dimensional supersymmetric gauge theories, where we can sometimes solve 
the entire theory by exploiting duality and the holomorphic nature of the pre¬ 
potential and superpotential. This will give us deep insight into the mechanisms 
which may persist in superstring theory. Then, in later chapters, we will apply 
these methods to string theory, which then allows us to show the existence of 
a still-mysterious 11-dimensional theory called M-theory. 



13.2 Electric-Magnetic Duality 429 


13.2 Electric-Magnetic Duality 

Duality was first observed in Maxwell’s theory of electricity and magnetism. 
Dirac showed that if we add magnetic monopoles to the standard Maxwell’s 
equations, then the electric and magnetic charges are quantized under 

eg = 2nn (13.2.1) 

where n is an integer. Then the theory is invariant under the interchange 

E -> B, 

B-* -E. (13.2.2) 

Notice that we can transform a theory of weakly coupled electrons into 
a theory of strongly coupled magnetic monopoles, and vice versa. This is a 
nontrivial symmetry, because it shows that we may penetrate into the strong 
coupling region of a theory by exploiting the perturbation region of its dual. 

When’t Hooft-Polyakov monopoles were discovered in gauge theory, Mon- 
tonen and Olive [7] tried to generalize this dual symmetry to non-Abelian 
theories. In particular, it was discovered that electrons, monopoles, and dyons 
(which have both electric and magnetic charges) obeyed the following mass 
formula: 


M 2 >c 2 (q 2 + q 2 J' 2 , (13.2.3) 

where the q e and q m represent the electrical and magnetic charges, respec¬ 
tively, and c is a constant. Notice that the mass of a particle is related to its 
charge. As in the Abelian case, we see that this formula is invariant under an 
interchange of electric and magnetic charges, so one may in principle probe 
the nonperturbative region of the theory. 

When this relationship becomes an equality, we have what is called a BPS 
state [8]. (These BPS states will figure very prominently in our discussion, 
since it is believed that nonrenormalization theorems protect these BPS states 
from being renormalized. Hence, they should appear even in a nonperturbative 
treatment of the theory.) 

We will find it convenient to rewrite this important formula. Let n m and n Q 
be integers representing the magnetic and electrical charges of the particle in 
question. Then, exploiting Dirac’s relationship, we can show that a BPS state 
satisfies 


M = | ce(n Q + r/i m )|, (13.2.4) 

where q e = en e and q m = ezn m , with n c and n m being integers, and 


.4tt 0 
t — i ~^r 4- ~—. 


(13.2.5) 


e* 2 ii 

(Notice that we have added the 0 angle, which is generated by instantons and 
appears in the action multiplied by F llv F llv .) 



430 13. Seiberg-Witten Theory 


Now let us rewrite this expression via a — ce and a D = re. Then we can 
write the BPS relation as 


M — 



(13.2.6) 


Now it is obvious that this mass formula is invariant under a set of transforma¬ 
tions generated by matrices of unit determinant. Since the charges n m and n e 
are integers, this means that M is invariant under transformations of SL{ 2, Z). 

For example, shifting 0 by In does not change the physics. This, in turn, 
corresponds to the transformation 


r 


r + 1. 


(13.2.7) 


Similarly, the transformation a a D and -» — a corresponds to 

1 

r -> (13.2.8) 

This last identity is quite crucial, since it links the perturbative with the non- 
perturbative regions of the theory, if we interchange magnetic and electrical 
charges n t —> n m and n m —> — n e . 

These two transformations generate the group 5L(2, Z): 


ar + b 
ct + d' 


(13.2.9) 


where a , b , c, d are integers. 

It was tantalizing for Olive and Montonen to speculate that the weak coupling 
region describing electrons was dual to the strong coupling region describing 
monopoles. Unfortunately, they were unable to push this assumption very far. 

Today, with duality and supersymmetry, one can use holomorphic pre¬ 
potentials and superpotentials to completely solve the iV = 2 supersymmetric 
gauge theory and realize some of these conjectures. 


13.3 Holomorphic Potentials 

Let us first analyze these techniques in the context of the simplest possible 
supersymmetric field theory, the Wess-Zumino model. We will use the holo¬ 
morphic property of this theory to rederive the nonrenormalization theorem of 
supersymmetric theories. These powerful theorems use the superfield formal¬ 
ism to show that supersymmetry forbids certain interactions to any order in 
perturbation theory. Hence, certain coupling constants are not renormalized to 
any order. 

In the Wess-Zumino model, the interaction is given by 

w = m 2 <t> 2 + X(f) 3 . (13.3.1) 

Now let us turn on quantum effects. Since the theory is invariant under 
a U( 1) x U(\)r symmetry, we will assign the following coupling constants 



13.3 Holomorphic Potentials 431 


to both the fields and the coupling constants. (As in string theory, where the 
coupling constant emerges as the vacuum expectation value of one of the 
fields, we will assume that the coupling constants have quantum numbers.) 
Let us assign the following quantum numbers: 



U( 1) X 


</> 

1 

1 

m 

-2 

0 

X 

-3 

-1 


Now let us assume that the exact potential, after quantum effects have been 
added, is holomorphic, i.e., it is only a function of the fields and their coupling 
constants (but not their complex conjugates). By simply counting quantum 
numbers, we see the most general and unique holomorphic potential is given 
by 

Weff = mtf> 2 / , (13.3.2) 

where / is some arbitrary function. Now power expand this function 

oo 

W eS = ^c,/n 1 -'X'>' +2 . (13.3.3) 

1=0 

If we examine this sum carefully, we find that each term corresponds to 
corrections due to higher-order tree diagrams. But notice that loop corrections 
contribute terms which are not of this form. Since / was the most general holo¬ 
morphic function compatible with our quantum numbers, we see that these loop 
contributions are therefore strictly forbidden. This means that these quantum 
corrections cancel, which is precisely the statement of the nonrenormalization 
theory applied to the Wess-Zumino model. 

We see, therefore, that the simple assumption of holomorphic potentials 
forbids the appearance of loop corrections, which allows us to rederive the 
nonrenormalization theorem for the Wess-Zumino model. 

When loop corrections are added, we expect that the prepotential will be 
modified in nontrivial ways. Therefore, it helps to summarize the kinds of 
phases that one might expect to find in this interacting gauge theory. 

In this regard, it is helpful to introduce the Wilson loop parameter 

W w =Tr (13.3.4) 

where we take the path-ordered integral over a rectangle, of length T and width 
r. Physically, this characterizes two electrically charged sources (e.g., quark- 
antiquark pair) which are created a distance r apart, which then propagate for 
a time T before they annihilate. 

Then we can extract the potential from the expectation value 

lim (W w ) = e~ TV{r) . 

T^-oo 


(13.3.5) 



432 13. Seiberg-Witten Theory 


Let V(r) represent the potential function between quarks. Then, depending 
on the value of V(r), the theory can be in one of several phases. 

In the Coulomb non-Abelian phase , we find the familiar spectrum of free 
photons and electrons. The potential between particles goes as 

e 1 

V(r)~ —, (13.3.6) 

r 

where the charge e is a constant. (For the other cases, we will see that the 
electric charge can be renormalized to include logarithms, so that the constant 
e becomes e(r).) 

In the free electric phase , we find massless electrons and photons, but the 
electric charge is renormalized, i.e., e 2 (r) ~ log(rA). The interactions are 
weaker than the standard Coulomb phase due to the logarithm. The interaction 
goes to zero at large distances. The theory is not asymptotically free. The 
potential goes as 


V(r) 


log(rA) 

r 


(13.3.7) 


In the free magnetic phase , which is the dual of the free electric phase, there 
are massless magnetic monopoles which renormalize the electric coupling 
constant to infinity. The charge goes as e 2 (r) ~ 1 /log(r A). The potential goes 
as 


V(r) ~ 


1 

r log(r A) 


(13.3.8) 


In the Higgs phase , we find that the potential behaves like 


V(r) const. (13.3.9) 

In this case, we have an exponential fall off in the perimeter of the loop, and 
the gluon fields do not confine the quarks. 

In the confining phase , we find that the potential goes as 

V(r)~r. (13.3.10) 

This means that the flux lines have condensed into a narrow tube, which con¬ 
fines the charges. Then the Wilson loop damps as the exponential of the area of 
the loop. This, in fact, can be taken as a criterion for confinement for electric 
charges. This phase is the dual of the Higgs phase. 

A problem arises, however, when we want a criterion for discussing the 
confinement of monopoles. For this, we introduce another order parameter, 
the’t Hooft parameter W t , which is made by considering twisted boundary 
conditions around the Wilson loop. In this sense, we are replacing quark- 
antiquark pairs with monopole-antimonopole pairs. We can set up the usual 
path-ordered integral and take the limit 

lim (W t ) = e~ TV(r) . 


(13.3.11) 



13.4 N = \ SUSYQCD 433 


In contrast to the Wilson loop, the’t Hooft loop for the free electric and 
free magnetic phases have potentials which are reversed from the potentials 
found for the Wilson loop. The potentials for the confinement and the Higgs 
phase are also reversed. The potential V ( r ), however, remains the same for the 
Coulomb phase. 

Lastly, we need an order parameter to discuss the confinement of dyonic 
charges. We introduce a dyonic order parameter W d — W w W t . This order 
parameter is convenient when discussing oblique confinement, when referring 
to dyonic charges. 

In summary, we have the following area/perimeter laws: 


Phase 

W w 

W, 

w d 

Higgs 

perimeter 

area 

area 

Confinement 

area 

perimeter 

area 

Oblique 

area 

area 

perimeter 


Let us also make one closing remark. Often, to simplify the problem, we 
will rewrite the theory in terms of an effective action. In general, there are two 
ways in which to obtain the effective action. First, we can integrate over one- 
particle irreducible diagrams. The standard generating functional T(/z, 0) is 
obtained from the generating functional of connected diagrams by a Legendre 
transformation, fi is the scale used to defined renormalized vertex functions. 
Second, we can take the Wilsonian approach, i.e., integrating over all loops 
above a certain energy /x, giving us an effective field theory action S(fi , 0) with 
fields defined below this energy. For a massive field theory, we find that there is 
no difference between these two approaches, i.e., we find T(/x, 0) = S(/x, 0). 
However, for theories with massless particles, we find that the effective po¬ 
tential given by summing over one-particle irreducible diagrams differs from 
the one obtained using the Wilsonian approach. In particular, the one-particle 
irreducible diagrams may suffer from holomorphic anomalies in the infrared re¬ 
gion, which would spoil our analysis. For this reason, we will use the Wilsonian 
approach in this chapter. 


13.4 N = 1 SUSYQCD 

Now let us examine a more complicated model, N = l supersymmetric QCD 
with Nf flavors and N c colors [1^4]. We will be able to explore the nonpertur- 
bative behavior of supersymmetric QCD for various values of Nf. In addition to 
using holomorphic superpotentials, we will be able to determine the nonpertur- 
bative nature of this theory by cobbling together various techniques, including: 



434 13. Seiberg-Witten Theory 


(a) asymptotic freedom; (b) ’t Hooft anomaly constraints; (c) superconformal 
algebras, etc. 

We begin with the superfield formalism, which contains fields that are de¬ 
pendent on both space-time variables x>‘ as well as Grassmann variables 9 a and 
da = (#“)*, which are complex two-component spinors. (Together, they form a 
Dirac spinor, with four complex spinors.) We use the conventions: 9\jf = 9 a ^r a , 
e 2 = e a o a =_-2 o'e 2 ,6a^e = e a a^e 61 , a* = (i, o">, ^ 
xfrx = ir a Xa, 'Px. = PaX ° , where o' are the standard Pauli matrices. 

Let us introduce the covariant derivatives 


D a = d/d6 a +io a a6 a d lx , 

' .. (13.4.1) 

Da = -d/dd a - io ai d a d ll . 

In general, an arbitrary superfield is reducible under supersymmetry. To 
find an irreducible representation, we observe that these covariant derivatives 
anticommute with the generators of supersymmetry, so we are free to impose 
the constraint Da® = 0 on the chiral superfield 4>, giving us an irreducible 
superfield. 

Then we can power expand the chiral superfield ®: 


d> = <My) + V2 9f(y) + 9 2 F(y) 

= (p(x) + ida^dd^pix) — 1 9 2 9 2 d 2 p(x ) + \f29p{x) 


- - 7 z9 2 (d ll ir(x)o fl 9) + 9 2 F(x), 

v 2 


(13.4.2) 


where we use the convention + Wa^a. 

We can also introduce the vector superfield V which transforms in the adjoint 
representation of SU(N) 


V(x, 9,9) = -90*9A„ + i9 2 (9k) - i9 2 (9k ) + \9 2 9 2 D. (13.4.3) 

From V and <t>, we can construct the chiral spinor which contains the Yang- 
Mills field tensor F^: 


W a = ^-D 2 (e 2gV D a e- 2gV ) . (13.4.4) 

8 g 

In components, this can be written as the expansion 

W = (-ik + 9D- ia^dF^ + 9 2 o»V^k) (y), (13.4.5) 

where V^A = 9^A — ig[A A]. 

We can now write the supersymmetric ^ = 1 Yang-Mills action (using the 
convention f d 2 9 0 2 = —2 and f d 2 9 d 2 9 9 2 9 2 = 4). 



13.4 N = 1 SUSY QCD 435 


We introduce the complex variable r as before 


1 

I6ni 


Im 

1 

~ g 

+ 


x J d 4 xd 2 0 Tr 

- j d 4 xTr[-\F^F» v 


32n 2 


J d 4 xF llv F tiV . 


iXa^ il l + \D 2 ] 


(13.4.6) 


Notice that r, because it is a complex number, contributes to both the F 1 
term as well as the instanton term FF. 

In addition, the coupling to the chiral superfield is given by 

^ j d A xd 2 0d 2 6 [Tr<t> + e _2gV <i>] 

= J rf 4 ;tTr(|V^| 2 - + F+F 

- g<f> + [D, 4>] - V2 ig<t> + {X, 4>) + V2 igf[l, 4> ]), (13.4.7) 

Let us now analyze a supersymmetric form of QCD, where we introduce 
the squark superfield Q which contains the quark fermions in the fundamental 
representation of the gauge group. We can introduce a potential term into the 
action. For the group £7(1), for example, a simple interaction term involving 
Q and Q would be 


V = (Q t Q-&Qf. (13.4.8) 

This means that there is a continuum of degenerate vacua labeled by {Q) = 
(Q) — a, for some value of the complex constant a. For a ^ 0, the gauge 
group is broken down by the super-Higgs mechanism. The gauge superfield 
gets a mass \a\ by “eating” one superfield degree of freedom, as in the Higgs 
mechanism. So one superfield degree of freedom remains massless. We can 
choose this massless superfield to be X = QQ. We have (QQ) = a 2 . We 
say that X is a modulus whose expectation value labels the moduli space of 
degenerate vacua. 

Now let us generalize this simple example for the case of SU(N C ) QCD with 
Nf flavors. We see that the potential can vanish not just for a few discrete values 
of the fields, but for a continuum of values. We will represent the expectation 
values of the fields Q l and Q l by 


(a i 


q = q = 


a 2 


\ 




(13.4.9) 



436 13. Seiberg-Witten Theory 


for Nf < N c (where the columns represent the color indices, and the rows the 
flavor indices), and 


fa x 


q = 


ai 


fax 


q = 


a 2 


a N c J 


(13.4.10) 


aN c / 


for the case Nf > N c . 

We find that the superpotential vanishes for a continuous set of expectation 
values q . These correspond to the Higgs, and will break the symmetry group 
SU(N C ) down to a subgroup. This also means that some of the gauge bosons 
become massive. 

This continuum of solutions, each of which represents an inequivalent vac¬ 
uum, is called the classical moduli space of the theory. We also say that the 
potential vanishes along these flat directions corresponding to this moduli 
space. Much of this chapter will be spent trying to deduce the analytic properties 
of this moduli space. 

Furthermore we see that, for a certain set of values of the expectation values, 
there is a singularity in the moduli space. By this, we mean that the symmetry 
group abruptly changes at these singular points. For example, if (q) = 0, then 
we see that the moduli space is singular at the origin. In particular, we will 
find that the symmetry group SU(N C ) is restored at this singular point. Since 
this enhancement of the gauge group means that certain gauge fields must 
become massless, we find that singularities in the moduli space corresponding 
to certain particles (gauge particles, monopoles, etc.) becoming massless. 

Let us now examine these features when we turn on the quantum interactions. 
In analyzing the nonperturbative behavior of supersymmetric QCD, we will 
extensively use two key results: the holomorphic form of the superpotential, 
and asymptotic freedom. 

As in the example of the Wess-Zumino model, we will assume that the 
superpotential is holomorphic in the fields and the coupling constants. Then, 
we find the following form of the nonperturbative potential 


Weff = (N c - Nf) 


A (3V c -Vf)/(V c -Vf) 

(det QQ)V/Wc-Nf) 


(13.4.11) 


We can also add supersymmetric mass terms to analyze this theory in the 
presence of small masses. 

Second, in order to deduce the properties of the theory for various values of 
Nf, it will be useful to calculate the beta function of the theory. 

The f can be calculated to give 


P(g) = 
r(g 2 ) = 


g 3 3N C - N t + N ( y{g 2 ) 
I6n 2 1 — N c (g 2 /Sn 2 ) 

8 2 N 2 - 1 


8 tt 2 N c 


+ 0(g 4 ), 


(13.4.12) 



13.4 N = 1 SUSYQCD 437 


where y(g 2 ) is the anomalous dimension of the mass. This, in particular, shows 
that a fixed point exists for every 3N c /2 < Nf < 3N C . 

We will now examine this theory in various regions. 

13.4.1 N f < N c 

To describe the moduli space, it is convenient to introduce gauge invariant 
moduli rather than relying on the individual Q. For Nf < N C9 we can choose 
the meson composite field for the gauge invariant coordinates on moduli space 
as M l j = Q l Qj. There are no baryon composite fields. In general, the vacuum 
expectation values can break SU(N C ) gauge symmetry down to SU(N C — Nf) 
gauge symmetry. 

However, notice that the potential is modified by nonperturbative quantum 
corrections. (In fact, for Nf = N c — 1, this quantum superpotential is generated 
by instantons, while for < A^ c — 1 it is generated by gluinos.) 

Several surprising conclusions can be drawn. First, we see that the nonrenor¬ 
malization theorem is violated by nonperturbative effects. Since the original 
nonrenormalization theorem was derived perturbatively using Feynman’s rules 
with chiral superfields, we see that it is not powerful enough to determine the 
nonperturbative character of a supersymmetric theory. 

Second, the new superpotential slopes off to zero. This means that the theory 
is unstable, i.e., it has no ground state. Therefore the theory does not exist 
quantum mechanically. (Ironically, the classical theory has an infinite set of 
ground states, but the quantum theory has none.) 

For Nf > N Cy however, we see that the superpotential does not exist at all, 
i.e., it either diverges, or the determinant vanishes. Although this means that the 
vacuum degeneracy is not lifted quantum mechanically, the quantum moduli 
space can still differ from the classical one. In this region, we can also break 
SU(N C ) completely. 

13.4.2 Nf = N c 

For the case Af > N C9 we have meson, baryon, and antibaryon composite fields 

Mf = Q i Q j , 

g[ii,...iN c ] _ Qi\ 

4w*] = e,v--<2,„ c ’ (13.4.13) 

Classically, the moduli are constrained by 

det M - BB = 0. (13.4.14) 

(This follows simply from the Bose statistics of Q and Q .) 

For Nf — N c , quantum effect transforms the previous constraint equation 
into 


det M — BB = A 2a \ 


(13.4.15) 



438 13. Seiberg-Witten Theory 


where the scale A is determined dynamically. The important correction to 
the moduli space is the right-hand side of the equation, which smooths out 
the singularities of the moduli space. (This term arises due to a one instanton 
effect.) Classically, a singularity of moduli space is given by Q — Q — 
0, which corresponds to certain particles becoming massless. However, this 
singular point in the classical moduli space does not satisfy the above identity, 
so the origin is missing in the quantum moduli space. 

Although the moduli space has no singularities, the massless particles are the 
moduli themselves, the mesons and baryons. If fact, we have confinement in 
this region. The Higgsing of the magnetic variables yields confinement of the 
electric variables. Here we have confinement with chiral symmetry breaking. 


13.4.3 N f = N c + l 

Notice that the determinant vanishes for this value. Hence, the classical vacuum 
degeneracy is not removed quantum mechanically. The classical moduli space 
equals the quantum moduli space. But there are singularities in the moduli 
space, which signal the fact that certain particles, the M and B states, are 
becoming massless. 

If we turn on instantons, we can show that the constraints are modified in 
the following way: 


detM^T^ — B‘ Bj — 0, 
M‘B, = M)B j = 0. 


(13.4.16) 


As before, the massless particles are given by the moduli, the mesons, and 
baryons, which are now elementary fields. The baryons can be viewed as 
magnetic monopoles composed of elementary quarks and gluons. 

In this region, we have confinement with chiral symmetry. 


13.4.4 N c + 2<Nf<lN c 

In this region, the electric fields are all strongly coupled. We refer to it as being 
in the free magnetic phase. The massless magnetic states are composites of the 
electrically charged states. 

The interesting observation is that, in this region, a gauge theory with 
SU(N C ) gauge symmetry is dual to an equivalent theory with SU(Nf — N c ) 
gauge symmetry and Nf flavors. 

Symbolically, we say that 

SU(N C ) +> SU(N { - N G ). (13.4.17) 

At first, this may seem like a rather strange duality, since duality connects 
theories with different gauge groups. However, it can be shown that these 
two theories represent the same fixed point, i.e., it is impossible to devise an 



13.4 N = 1 SUSYQCD 439 


experiment which will determine whether the Coulomb force 1/r is being 
mediated by electric or magnetic charges. 

The fact that the two gauge symmetries can be different can be explained by 
the fact that gauge symmetries are not real physical symmetries. They actually 
represent a redundancy of the system. The two theories should, however, have 
the same global symmetries. To match the particle content of the two theories, 
we note that the quarks and gluons in one theory can be reinterpreted as solitons 
(e.g., monopoles) of the elementary fields of the other theory. The two theories 
may look quite different, but they have the same long-distance physics. 

This phenomenon is actually rather common in two-dimensional physics, 
and is called quantum equivalence, but it is rather strange to see it manifested 
in a four-dimensional theory. 

This means that the free magnetic phase exhibited in this region is dual to a 
free electric phase in the dual region Nf > 3N C . 

13.4.5 \N C <N { < 3 N c 

In this region, the theory is asymptotically free. At low energies, the coupling 
constant grows larger. However, it does not grow to infinity, but reaches a finite 
value, a fixed point. 

Since the potential between elementary quarks and gluons goes like 1/r, 
we say that the theory is in the non-Abelian Coulomb phase. 

What is novel about this phase is that it is self-dual, i.e., the region 2 N c < 
Nf < 3 N C9 which corresponds to the free electric phase, is dual to the free 
magnetic phase region 3N c /2 < Nf < 2 N c because of the duality between 
SU(N C ) gauge theory and SU(Nf — N c ) gauge theory. In particular, both dual 
regions have the same global group SU(Nf) x SU(Nf) x U(1)b x U(\) r . 

13.4.6 N f >3N c 

In this region, the theory is not asymptotically free. The electric charges are 
free in the infrared region while the magnetic ones are strongly coupled. The 
spectrum of particles at low energy is given by elementary quarks and gluons. 
In fact, since the fall-off in the potential between electric sources is weaker 
than the Coulomb phase and goes as 1/r log r, this means that we are in the 
free electric phase. 

As we pointed out earlier, this free electric phase is the dual to the previous 
free magnetic phase mentioned above for the region N c 4- 2 < Nf < \N c . 

So far we have only analyzed SU(N C ) gauge theory. When we generalize 
our results to SO(N c ), we find many interesting new features. 

13.4.7 SO(N c ) SUSY Gauge Theory 

When we replace SU(N) with SO (A), a number of features changes in our 
analysis. 



440 13. Seiberg-Witten Theory 


As before, we find a duality map between two gauge theories with different 
gauge groups. In this case, we find the duality 

SO(N c ) SO(N { -N c - 4). (13.4.18) 

But the most dramatic difference found here is oblique confinement, which 
refers to the condensation of dyons with both electric and magnetic charges. 

The phases are given by: 

• For Nf = N c — 2, the theory is in the Coulomb phase. 

• For N c — 2 < Nf < | (N c — 2), the theory is the free magnetic phase, with 
a composite gauge group SO(Nf — N c + 4). 

• For |(N C — 2) < Nf < 3(N C — 2), the theory is asymptotically free and 
flows to a non-Abelian Coulomb phase. 

• For Nf > 3 (N c — 2), the theory is not asymptotically free and is in the free 
electric phase. We have massless quarks. 

In summary, a combination of duality, holomorphic superpotentials, and 
asymptotic freedom has allowed us to extract a considerable amount of non- 
perturbative information about supersymmetric QCD. We find that the classical 
moduli space of flat directions is sometimes modified by nonperturbative 
corrections, and that singularities of the moduli space correspond to gauge 
enhancement and states becoming massless. 

Now let us turn our attention to an exactly soluble model, supersymmetric 
N = 2 gauge theory. 


13.5 N = 2 SUSY Gauge Theory 

For the N = 2 supersymmetric SU( 2) gauge theory, the holomorphic pre¬ 
potential allows us to make precise statements about the nature of strong/weak 
duality [5, 6]. 

Originally, Montonen and Olive [7] suspected that this duality may apply 
to the N = 4 supersymmetric gauge theory (which has a beta function which 
vanishes to all orders in perturbation theory). 

The N = 2 theory was deemed unsuitable for the Olive-Montonen conjec¬ 
ture, since the electrons are in a multiplet with s < 1 but the monopoles are 
in a multiplet with s < Thus, supersymmetry could not be written between 
the electric and magnetic phases of the theory. 

By contrast, the N = 4 supersymmetric gauge theory appeared much more 
reasonable, since the electric/magnetic fields representation were in the same 
representation. (But when these models are actually solved, we find that the 
N = 4 model is too constrained and trivial. By contrast, the N = 1 theory is 
so complicated it cannot be solved analytically. Ironically, the N = 2 model 
lies in the middle and is constrained enough to be exactly solvable.) 

The N = 2 theory is composed of superfields commonly found in the 
standard N = 1 theory. We will find it convenient to put the chiral field O and 



13.5 N = 2 SUSY Gauge Theory 441 


W a into a single function 4*. To do this, we must introduce a second set of 9 
variables, which we denote as 9 a and 9«. The 4> superfield is given by 

vp = 4 >(y, 9) + V29 a W a (y, 9) + 9 a 9 a G(y, 9), (13.5.1) 


where 


G(y, 9) = —^ j d 2 9 [<I>(y - i9a9, 9, 0)] t exp [-2 gV(y - i9cr9, 9, 9 )\, 

y = x* 1 + i9a ,x 9 + i9cr ,i t>. (13.5.2) 

The action is then written as 

j Im d 4 x d 2 9 d 2 9 Tr 4* 2 ^ , (13.5.3) 

whose expansion gives 

S = Im Tr j d 4 x d 2 9 W a W a + J d 2 9 d 2 9 . 


by 


(13.5.4) 

If we expand this action, we can collect all the nonpropagating terms, given 


4 f d 4 x Tt[\D 2 - g4>\D, 4>] + F^F], (13.5.5) 

§ J 

If we solve for the auxiliary fields D and F , we find 

V{4>)= \jr[<t>,<t> 4 ] 2 . (13.5.6) 

r 

Notice that the potential vanishes for the flat directions, which are specified 
by nonzero values of (j). 

This Higgs potential will vanish for (0) = \ao^ where a 3 is the Pauli 
matrix, a labels the flat directions of the theory. Nonvanishing values of a, in 
turn, break the gauge group 517(2) down to C/(l), but keep supersymmetry 
intact, as desired. 

We find it convenient to introduce an invariant parameter to describe the 
moduli space, i.e., 

u = Tr(0 2 ). (13.5.7) 


Perturbatively, we have a = >/2m. (In the nonperturbative region, we will 
find that this relationship between a and u is broken.) 

In general, 517(2) will break down to U(\) in the presence of nonvanishing 
{(/>). (For higher Af, the potential will vanish if <j> is part of the Cartan subalgebra 
of the group.) This means that the classical moduli space simply corresponds 
to the complex plane, minus the origin, the point at which SU( 2) is restored. 

In summary, we find that a nonvanishing a breaks the 5 U (2) symmetry down 
to U(\), but still preserves the supersymmetry. This is the reason why we can 



442 13. Seiberg-Witten Theory 


use SUSY to determine the complete potential, since SUSY is unbroken. We 
find that the moduli space is the complex plane, but is singular for u = 0, 
where the original SU(2 ) symmetry is restored and there are new massless 
particles, i.e., the gauge bosons. 

Now turn on the quantum interactions. We can show that the most general 
effective action is written in terms of an arbitrary, holomorphic function of 'I': 

Im j d 4 xd 4 d jT(vp). (13.5.8) 

Notice that the crucial nonperturbative information is encoded entirely into a 
single function F. 

Classically, this holomorphic function F reduces to 

F 0 = jT'V 2 . (13.5.9) 

Our task is now to find the quantum corrections to F 0 . 

If we power expand F, we find a large number of terms transforming under 
higher representations of SU( 2). In particular, we find terms like dF/d<t> a , 
where a is a Lie algebra index. Now, we will adopt the Wilsonian approach 
mentioned earlier to find the effective potential. The gauge group SU( 2) will 
be broken down to U (1) fields, which remain massless. We will integrate over 
the a — 1, 2 indices, representing the massive fields, leaving only the massless 
fields with a = 3. Thus, the effective action for the remaining massless (7(1) 
fields can be represented as follows: 


16jt 


-Im 


[/ 


***♦> + 

a<i> 


/ 


9 9 2 JT(<D) 

^ -d^ w ° wa 


(13.5.10) 


Our task is now to find the exact form of F, including all nonperturbative 
quantum corrections. To obtain the basic structure of F, we can carefully 
calculate both perturbative and instanton quantum corrections. By explicit 
calculation, the F field is modified to 


1 \I/2 oo / A\ 4k 

* 2 ’ < i3 - 5 - n > 


where the first term is generated by tree and loop diagrams, and the summation 
is generated by instanton effects. Our goal in this chapter is to find the exact 
expression for T. 

At first, it seems hopeless to get a simple, closed form for T. But, in fact, 
we will be able to find the exact form for T using the power of duality. 

Let us first express everything in terms of a. We will write r(a) = T" 
and a D = dj r {a)/da. We showed previously that the BPS mass formula is 
invariant under SL(2, Z) . However, this is not enough. We must now show 
that the action itself is also invariant under this transformation. 

We begin by introducing an extra vector superfield V D as a Lagrange mul¬ 
tiplier. This superfield will simply enforce the condition that D a W a = 0. The 



13.5 N = 2 SUSY Gauge Theory 443 


advantage of introducing this seemingly extraneous superfield V D is that, in¬ 
stead of integrating over the vector superfield V, we can now integrate over 
the chiral spinor field W a and Vd'- 


/ 


VV exp Im J d 4 x d 2 9P'(^)W a W a 
~ j T>WW d exp j^Y^- Im 

x d 2 9 P'(<t>)W a W a + ^ j d 2 9d 2 9V D D a W a 



.(13.5.12) 


So far, we have done nothing. But now notice that we can integrate by parts 
and flip D a , which acts on W a , so that it acts on Vd instead. This integration 
by parts proceeds as follows: 


J d 2 9 d 2 9 V D D a W a = - J d 2 9 d 2 9 D a V D W a = j d 2 9 D 2 (D a V D W a ) 

= J d 2 9 (b 2 D a V D )W a 

— —4 J d 2 9(W D ) a W a , (13.5.13) 


where we have used the fact that DpW a = 0 and where we have introduced 
the new superfield W D in the same way we introduced V D , such that (Wo) a = 
-i b 2 D a V D . 

The whole point of rewriting this expression in this way is that now we can 
do the functional integral over W : 


j VVd exp 



P'W 


W a D W Da 


)] 


(13.5.14) 


This is the key result. Notice that we have now replaced the usual N = 1 
Yang-Mills action with the effective coupling r (a) = T"{a) by — l/r(a). 

We see that a has turned into a D and r turned into — 1/r which is 5 sym¬ 
metry. Since the theory was trivially invariant r -> r + 1, and since these two 
transformations generate the entire group, we see that the action is invariant 
under the full SL( 2, Z). 

Now let us begin the process of writing the full quantum theory by analyzing 
the singularities and asymptotic behavior of the holomorphic function. 

Notice that the theory is asymptotically free. This means that in the large a 
and large u limit, we should recover the perturbative results, so that u — \a 2 . 
To find a D , we notice that the tree and loop contributions to T can now be 
written as 



444 13. Seiberg-Witten Theory 


Because a D — T(a)\ we can now write 



(13.5.16) 


in the asymptotic regime. 

To explore the singularity structure of the theory, let u go around infinity 

u -> e 2ni u. (13.5.17) 

Then (since u = \a 2 in the large a region), we find 
i ( e 2ni a 2 \ 

&d ~(~ a ) " ^2 -b 1 1 = —cld + 2a. (13.5.18) 

Because of the presence of cuts, we see that neither a nor a D is a single-valued 
function of the complex plane. In fact, they transform into each other when u 
goes around infinity. So 

--=(« -.)• 

The monodromy matrix simply represents how a D and a transform into 
each other as u goes around infinity. 

One crucial difference between the classical and quantum moduli space is 
that the point u = 0, which was a singularity in the classical theory, is no longer 
a singularity in the quantum theory. This may seem strange, but quantum effects 
mean that u no longer is approximated by a 2 in the strong coupling region. 

What is the full analytic structure of the quantum moduli space? We will 
find that there are three singularities, but that the point u = 0 is not one of 
them. 

We notice that the original theory has £7(1)* symmetry which is broken by 
quantum corrections down to Z 8 . This discrete symmetry, in turn, transforms 
u — y — u. This means that singularities must come in pairs, u = ±mo- 

We are interested in the points in moduli space where the monopole becomes 
massless. For a monopole, we have the BPS relation m 2 — 2|a£>| 2 , which 
vanishes when a D = 0. We define the point w 0 which satisfies a D (u 0 ) = 0, 
i.e., the point where the monopole becomes massless. For points near uo , we 
therefore find that a D ~ c 0 (u — u 0 ) + • • 

To find the structure of moduli space around this monopole configuration, 
we will interchange the role of a and a#. Before, for the electric charge, we 
found that a D (u) ~ a(u) In a(u) + a(u) for a(u) -> 0. Now, reversing the role 
of a and a D , we find the dual relationship 

a(w 0 ) ^ a£>(wo) I** (Id{ u o) + cid( u o)* 

This, in turn, means that 

a D ~ c 0 (u - m 0 ), 


(13.5.20) 



13.5 N = 2 SUSY Gauge Theory 445 


i 

a ~ a 0 H—co(w — wo)ln(w — w 0 ). (13.5.21) 

71 

Now let u go around the point w 0 , ie., w — wo e l7Zl (u — m 0 ). Then we find 

(?)- (a-°2 00 ) = "" = (-2 O' 03 ' 5 ' 22) 

Lastly, we can find the monodromy matrix around the point — wo by 
observing that 


M x = M Mo M_ U o . (13.5.23) 

This gives us the last monodromy matrix 

M_ U0 = ^ . (13.5.24) 

These three elements form a group, called r 2 . This is the group of unimodular 
matrices congruent to the identity mod (2): 

r 2 = J ^ ^ e SL(2, Z),a = b= 1 mod(2), b = c = 0mod(2) . 

(13.5.25) 

We can also find the monodromy matrix which is associated with a specific 
electric and magnetic charge. We recall that the BPS charge Z can be written 
as 

z = (n M , n e ) ^ = (n m , n e )MM~ l ^ . (13.5.26) 

Since a massless state responsible for the singularity should be an invariant 
under the monodromy matrix, the vector (n m , n e ) should be an eigenvector of 
M with unit eigenvalue 

(n m , n e ) = (n OT , n e )M. (13.5.27) 


From this, it is now a simple matter to determine the explicit form of the 
monodromy matrix corresponding to a specific electric and magnetic charge: 


M(n m , n e ) 


(1 + 2 n m n e 2 n\ \ 

\ ~ 2n m 1 “ 2n ™n Q ) 


(13.5.28) 


This means, for example, that the massless state corresponding to the point 
u — uq is a monopole with charge ( 1 , 0 ), and the state corresponding to the 
point u — —uq is a dyon with charge ( 1 ,- 1 ). 

It now easy to see that, with these three monodromy matrices, we can gen¬ 
erate all possible monodromy matrices corresponding to dyons of arbitrary 
charge. For example, 

M-*M Uo A4 = ^ ~_2 k l +\k) =M{1 ' - 2k) ' 


(13.5.29) 



446 13. Seiberg-Witten Theory 


where M(n , m) is the monodromy matrix corresponding to the dyon with 
charge n and m. Similarly, 

= (~ l ~ 2 4k 2 + ** + *) = -1 - 2k). (13.5.30) 

So far, we have found three singularities. But how many singularities are 
there in total? In general, there are several ways in which one can argue that 
there are only three singularities, but none of them is absolutely rigorous. 
One convenient way is to use number theory. Let p be the total number of 
singularities. Then there must be monodromy matrices corresponding to these 
singularities M Ui = M(nn^) where i label the various singularities. Then, 
as usual, the monodromy matrices must satisfy the following relationship: 


Moo — M Ui M U2 ...M Up , (13.5.31) 

i.e., going around all the singularities is the same as simply going around the 
point at infinity. But this relation also means that a series of nontrivial identities 
must be satisfied with integer solutions. Using number theory, one can prove 
that this relation cannot hold for low values of p greater than 2, but it has 
not been proven for the general case. Hence, we strongly believe, but cannot 
rigorously prove, that there are only three singularities 

To gain some intuitive insight into this problem, we observe that this problem 
closely resembles an elementary problem found in ordinary quantum mechan¬ 
ics in solid-state physics. Let a solid be represented by a potential V which is 
periodic in x and has singular points. Then the Schrodinger equation reads 


dx 2 


+ V(x) 


ijr(x) = 0, V(x + 2ni ) = V{x). 


(13.5.32) 


This has two solutions, and fa. Now let x roam around the singularities. 
This, in turn, rotates fa into fa, creating the monodromy matrix 

(&) ( *+ 2*0 = «(&)(*>. (13 ' 5 - 33> 

Let us now impose the same monodromy properties on this Schrodinger 
potential that we found for the supergauge theory. With a little work, we find 


V(z) = - 


1 1 
4 (z + l)(z — 1) 


(13.5.34) 


The solution for y>i and fa can now be solved. This, in turn, gives us a 
solution for a and a o '. 


a D (u) = ifa(u) = F ( 5 , 5 . 2 ; , 

a(u) = -2ifa(u) = V2(u + 1 ) 1/2 F (-i, i, 1 ; , 


where F is the usual hypergeometric function. 



13.5 N — 2 SUSY Gauge Theory 447 


To gain further intuition into this problem, let us try to interpret 5L(2, Z) 
mathematically, by describing the complex u plane in terms of the symmetries 
of a torus. Specifically, we wish to equate two entirely different objects: the r 
which appears in the action with the period matrix r of a particular torus. This 
will allow us to derive expressions for a and a D as integrals over the cycles of 
a torus. 

If we have a torus, we can slice it in two inequivalent ways, along two 
different cycles. If we then unravel the sliced torus, we find a parallelogram, 
such that the opposite sides are identified with each other. If we place one comer 
of the parallelogram at the origin of the complex plane, then r represents the 
complex coordinate of the opposite comer. It is called the period matrix. Then 
SL( 2, Z) , operating on r, simply generates topologically equivalent torii. We 
would like to reinterpret our exact solution in terms of this toms. 

Now let us look at the complex structure of x space and its Riemann cuts. 
The analytic structure of x space looks like a double-sheeted plane, such that 
the two sheets are connected by cuts. The cuts extend from — 1 to +1, and from 
1 to infinity. 

Now add the point at infinity to both sheets. Then each sheet becomes a 
sphere. These two spheres are connected to each other via the cuts, so we have 
topologically created a toms. 

Notice that there are two cycles that can be drawn on this toms. One can be 
drawn around the cut which goes between —1 and +1, which we call yi- The 
other cycle, which we call y \, goes from 1 to u on the first sheet, and returns 
from u to 1 on the second sheet. 

Now let u move around the points ±1 and oo. Each time u moves around 
one of these singular points, the cycles y- turn into combinations of each other. 
Specifically, we find 


^ M ^ , M e SL(2, Z). (13.5.36) 


The point of this exercise is to write a D and a as integrals over one-forms 
defined over each of these cycles 


a D = (p X, a = (p X, (13.5.37) 

Jy\ Jyi 

Since the cycles y l rotate into each other via the monodromy matrix Af, then 
the integrals over these cycles must also transform in precisely the same way 

(?) -" (?) • < i3 - 5 - 38 > 

Our next job is to find this one-form X. This will then solve the problem 
completely. 

To solve for X, it will be helpful to introduce the formalism of “elliptic 
curves.” This will allow us to write X almost immediately. 



448 13. Seiberg-Witten Theory 


We introduce an elliptic curve 

y 2 = (x — l)(x + l)(;c — u ), (13.5.39) 


where the x plane has the cut structure mentioned before. Then there is a 
theorem which states that the r of the elliptic curve can be written as 


c< 

II 

(13.5.40) 

where 

dx 

X\ — —. 

(13.5.41) 

y 

Now we can calculate X. By definition, we know that 

/ x _ sr" _ ^ a D _ da D /du 

T(U) ~ da da/du * 

(13.5.42) 

If we compare the two expressions for r, we easily find that 

dX 

— - X\. 
du 

(13.5.43) 

This means that we can write 

(x — u) dx 

(13.5.44) 

y 

Finally, this allows us to write the answer we desired 

>/2 f u dx^Jx — u 

a D (u) = / , — -- , 

x J 1 s/x 1 — 1 

(13.5.45) 

\fl dx^/x — u 

a (u)= / n —r • 

n J-1 Vx 2 - 1 


This completes our problem. The a D and a variables are now given entirely 
in terms of integrals over a torus, whose period matrix r is precisely the r 
which appears in the action. Then the SL( 2, Z) transformations of the action 
can be seen as arising from the modular transformations of the torus. 


13.6 SU(N) N = 2 SUSY Gauge Theory 

Many of the same methods we used for the SU( 2) model can be generalized 
to the SU(N) case and other Lie groups [9-11]. The key to generalizing the 
previous discussion is to find the appropriate invariant variables, and then guess 
their exact form by analyzing the holomorphic structure of the superpotential. 

For example, for A 2 = SU( 3), we can introduce the variables: (<p) = 
diag(tfi, a 2 - a x , -a 2 ). Notice that the group SU (3) has two elements in its 



13.6 SU(N) N = 2 SUSY Gauge Theory 449 


Cartan subalgebra which are mutually commuting. These variables a\ and a 2 . 
however, are not invariant under the Weyl group of SU (3), which mixes the 
elements of the Cartan subalgebra. For example, the elements of Weyl group, 
acting on a \, a 2 , yields the following transformations: 

r, : (a u a 2 ) ( a 2 -a u a 2 ), 

r 2 : (a u a 2 ) (a u a x - a 2 ), (13.6.1) 

r 3 : (ai,a 2 ) {-a 2 , -a{). (13.6.2) 

But now introduce the invariant Casimir variables, which are invariant under 
the Weyl group 

«i = 5 Tr {(j > 2 ), «2 = I Tr <$ 3 ). (13.6.3) 

We wish to find the relationship between the a, and Let us define the 
invariant function 


W Al = det (xl-<p) 

3 

- ]”[( X ~ e '') 

i = l 

= x 3 — ux — v. (13.6.4) 

The relations between the various variables are given by u\ — a\ + a\ — a\ai , 
U 2 == cl\U 2 (cl\ — and e\ — —^ 2 , ^2 — o>\. 

The point of introducing this invariant function is that it generalizes the 
elliptic curve, which is now, in slightly different form, given by 

y 2 = 0^ 2 ) 2 - A 6 . (13.6.5) 

When A = 0, we recover the classical case. For generic values of the 
eigenvalues e iy the gauge group SU( 3) is broken down to U( 1) x U( 1). The 
classical moduli space is simply given by the complex u\ and U 2 spaces, minus 
the singular points. In particular, when any two of the eigenvalues are equal, 
then the classical moduli space becomes singular, and the unbroken symmetry 
is SU(2) x U( 1). So the classical moduli space is just {u\, W 2 }/E, where E 
represents the singular points where the eigenvalues become equal. 

For the quantum case, A ^ 0, and the hyperelliptic curve generates six 
eigenvalues 


3 

/=1 ± 

The quantum moduli space now becomes more complicated. The x plane, as 
before, is double-sheeted, so we can form two spheres by adding the points at 
infinity. We also have six eigenvalues ef, which form six branch points in x 
space. We can join these six branch points into three branch cuts. By joining 
the two spheres along three branch cuts, we form a torus with two holes, i.e., 
genus g = 2. 




450 13. Seiberg-Witten Theory 


To show how the variables transform, we introduce the generalization of the 
vector (ia D , a ), which is now given by (a D , i, 2 , 0i, a 2 ) where 


#D,z 


da t 


(13.6.7) 


Let us perform a Weyl transformation r x . The u, are invariant under a Weyl 
transformation, but a t are not. Then one can show that the various a* and a D j 
all rotate among each other, given by 


&D, 1 
& d , 2 

a\ 


a 2 


-1 

0 

2 

1 \ 


{&D, \\ 

1 

1 -1 

-1 


&D,2 

0 

0 -1 

1 


a x 

0 

0 

0 

1 / 

\ a 2 / 


(13.6.8) 


Carrying on in this way, we can generalize the monodromy of the torus and 
generalize the results of the SU (2) case to SU( 3). However, we would like to 
present the general case. 

If we generalize this discussion to A n _ 1 = SU(n) 9 then we follow the same 
steps. Let H k label the elements of the Cartan subalgebra of A n . Then 


n-\ 

<p = j 2 a k H *- 03 . 6 . 9 ) 

1=1 

As before, we then introduce the invariant function 

n 

WV, = det [*1 - <P] = Y\(x - e Xi ), (13.6.10) 

1=1 

where e Xt are the eigenvalues, and k t are the weights of the n -dimensional 
fundamental representation of the group. 

The invariant variables m, can then be read off from the previous equation 


W An _, = x n - u !+2 (a)x n 2 '. (13.6.11) 

1=0 

Comparing these two expansions for W A „_,, we find an expression linking 
the noninvariant e Xi with the invariants u k : 

«* = (-D* +1 E ( 13 - 6 - 12 > 

We have thus established the relations between a,, e Xj , and the invariant 
variables u k . 

Now let us compare the complex structure of SU (2) with that of SU(n ). In 
the case of SU(2), we recall that the classical moduli space consisted of the 
complex u plane minus the origin, where there was a singularity associated 
with massless particles. Now, the classical moduli space for SU(n) consists 
of the complex spaces of all the u k , minus the singularities (where two of the 
eigenvalues collapse into the same point, i.e., e, = e Xl ). At the singularities 



13.6 SU(N) N = 2 SUSY Gauge Theory 451 


where these points coincide, the gauge group is enhanced and certain particles 
become massless. 

Now let us analyze the generalization of the elliptic curve in the presence of 
quantum corrections. For the case of SU(n ), the genus g = n — 1 hyperelliptic 
curve corresponds to 

y 2 = (W An _>) 2 - A 2 ". (13.6.13) 

When A = 0, we recover the classical theory. This hyperelliptic curve defines, 
as before, a double-sheeted complex x plane. When we add the point at infinity 
to both sheets, we have two spheres. These spheres are connected by g + 1 
Riemann cuts. The branch points of these cuts correspond to the zeros of the 
above expression 


y 2 = tl n<* - e &- 


(13.6.14) 


1=1 ± 


Notice that the eigenvalues have now split into two ef i9 which form pairs of 
branch points on the x plane. We can then form g -F 1 branch cuts joining these 
pairs of branch cuts. WTien we join these two spheres along their cuts, we have 
a genus g torus. 

The whole point of this discussion is to find a single one-form X out of which 
we can construct all the holomorphic functions, e.g., for a t and cidj. Now that 
we have an explicit representation of the hyperelliptic curve, we can introduce 
n — 1 holomorphic differentials (abelian differentials of the first kind) defined 
on the x plane 


x l 1 dx 

CO n -i = - 

y 


(13.6.15) 


Out of these abelian integrals, we can form the generalized period matrix 


A tj = 

Bij = 



3 CLj 
du i+{ ’ 
dapj 
dUi +1 


(13.6.16) 


where and f}j are cycles defined on the genus g torus, such that the a* cycles 
can intersect with the fij cycles, i.e., (a f , Pj) = S/ 7 . However, the a t cycles 
never intersect with each other, i.e., (a, , ctj) =0 and (/} f , fy) = 0. 

Now assume that a one-form X exists such that a t and a D l can be written in 
the form 

a D ,i = f X, ai = I X. (13.6.17) 

J Ja t 

Our goal is to find X. To find such a one-cycle, we take the previous equation 
and take its derivative with respect to m /+ 1 . This then gives us a relationship 



452 13. Seiberg-Witten Theory 


between the abelian differential cot and A. We find a consistency constraint 


dk 


x l+l dx 


= <O n -i = 


3 y 

Fortunately, it is easy to integrate the previous equation. We find 


dx 

X = ——log 


4V2tt 


NV, + 




A 2 " 


W, 




(13.6.18) 


(13.6.19) 


Now that we have an explicit representation of the one-form A, we can 
calculate all the holomorphic functions of the exact theory. This completes our 
discussion of N = 1 and N = 2 SUSY gauge theory. 

Lastly, we mention that all this has direct bearing on string theory. We will 
find that many of the basic features of N = 1 and N = 2 SUSY gauge 
theory carry over to string theory. In fact, supersymmetric gauge theory is a 
“laboratory” in which we can test many of the theoretical concepts of string 
theory. 


• For example, we will find that the duality group SL( 2, Z) is the same 
S duality group found in IIB superstrings, allowing us to analyze the 
nonperturbative region of IIB strings. 

• The singularities of moduli space, where the symmetry is enhanced, are 
also found in string theory, where they are called conifold singularities. 
(This enhancement of symmetry will prove crucial in string theory, since it 
will allow us to show the duality between two very different string theories 
with different gauge groups. The discrepancy in the spectrum is resolved 
once we have enhanced symmetry.) 

• The elliptic curve becomes generalized to a Calabi-Yau manifold in string 
theory. 

• The complex quantum moduli space, which was a torus of genus g, 
becomes generalized to the moduli of Calabi-Yau spaces in string theory. 

• The r M monodromy group becomes generalized to T duality in string 
theory. 

• The duality found in N = 2 supersymmetric theories has a direct analog 
in the duality between Type IIA strings compactified on K 3 , and heterotic 
strings compactified on a torus T 4 . 


13.7 Summary 

Duality has emerged as perhaps our most powerful way in which to analyze 
the nonperturbative behavior of both supergauge theories and superstrings. 
In particular, holomorphic superpotentials and prepotentials that appear in 
supersymmetric theories can often be obtained exactly by determining their 
singularities in their asymptotic behavior. 



13.7 Summary 453 


The Wess-Zumino model, for example, has a potential which can be de¬ 
termined by demanding that it be holomorphic and that it conserve certain 
quantum numbers. In this way, we can derive the nonperturbative behavior of 
the theory. 

Similarly, for TV = 1 supergauge theory, with N c colors and Nf flavors, 
we can use a combination of: (a) holomorphic superpotentials; (b) asymptotic 
freedom; (c) superconformal algebra; and (d) ’t Hooft anomaly cancellation 
to determine the nonperturbative superpotential 


Weff = (N c - N f y 


QNc-Nf) 
A (N c -N { ) 


(det QQY N *- N f' 


(13.7.1) 


as well as the nonperturbative bound states that emerge in the theory. In particu¬ 
lar, we find that superpotentials have “flat directions,” parametrized by moduli, 
which describe inequivalent vacua. 

In this way, we can categorize the N = 1 supergauge theory into several 
phases according to the values of Nf and N c : 

• The Coulomb phase, in which the quarks and gluons are free and the 
potential goes as 1/r. 

• The free electron phase, where the potential goes as log r A/r. 

• The free magnetic phase, which is dual to the free electric phase, where 
the potential goes as 1 /(log r A)r. 

• The Higgs phase, where the potential goes as a constant. 

• The confinement phase, which is dual to the Higgs phase, where the 
potential goes as r. 

More interesting, however, is the N = 2 supergauge theory, where we see 
the ideas of Olive and Montonen being realized. 

The N = 2 theory is constructed out of a superfield 'I', which contains both 
the vector superfield V (which contains the Yang-Mills potential), and the 
chiral matter superfield O. 

The N = 2 action is given as 


/ 


Im d 4 x d 2 e d 2 6 (— Tr 'I' 2 ) 
\32ti / 


(13.7.2) 


whose expansion gives 


S = Im Tr 


/ d * X ikd f d2 ° WaWa + / d2Qd2 ~ B ^> + e~ 2gV <i> 


If we integrate out the auxiliary fields, we find the superpotential 


(13.7.3) 


1 


V(0) = -Tr[0,0 T r. 

g 2 


(13.7.4) 



454 13. Seiberg-Witten Theory 


Notice that if^belongs to the Cartan subalgebra ofSt/(2),i.e., if 0 = ^aa 3 , 
then the potential vanishes along the flat directions parametrized by a. We could 
also have used the invariant variable w = Tr (<j> 2 ). 

The key point here is that quantum corrections will alter the above formula 
to the following: 


tJ— Im / d 4 x d 4 e W), (13.7.5) 

16jr J 

where T is an undetermined holomorphic function of which can be deter¬ 
mined if we know the singularity structure of this holomorphic function and 
its asymptotic values. 

If we integrate out all massive fields, leaving a Wilsonian effective action, 
then the action becomes 


1 

\6tc 



d*6 <D f 


aW) 

34) 


/ 


+ / d 2 e 


3 2 ^(<D) 

34> 2 


W a W a 


(13.7.6) 


In the Wilsonian effective action, only 1/(1) fields survive, and T can be 
expressed as a function of a and w. 

The key is to exploit the analytic properties of the holomorphic potential. For 
example, we know the asymptotic behavior of a and a D near infinity because 
of asymptotic freedom 


i 

dp = —a 

7 x 


(*£♦' 


)• 


a = \flu. 


(13.7.7) 


If we move around the point at infinity via u —*■ e 2n, u, then a and a D 
transform into each other via the transformation 




(13.7.8) 


Similarly, one can show that there are three singularities in the u plane, at 
u — w 0 , —wo, 00, which allows us to define the monodromy matrices A/ uo and 
M_„ 0 : 


"» = (-2 ?)• "- = (-2 3)' <13M) 

Since the system has SL( 2, Z) symmetry, our goal is to find a torus, with 
period matrix r, which precisely matches the r found for the N = 2 supergauge 
theory. 

To do this, we introduce an elliptic curve 

y 2 =(x - l)(x + l)(x - w), 


(13.7.10) 



13.7 Summary 455 


where the x plane has the cut structure mentioned before. Then there is a 
theorem which states that the r of the elliptic curve can be written as 


where 



(13.7.11) 




dx 

y 


(13.7.12) 


Now we can calculate the form X, whose line integrals around the torus give 
us a and a D : 


do 


= (p X, a = (p X. 

Jy\ Jy 2 


By definition, we know that 


r (u) = r = 


d&D da D /du 


da da/du 

If we compare the two expressions for r, we easily find that 

dx 


3 u 


' X \. 


This means that we can write 


X = 


(x — u)dx 


Finally, this allows us to write the answer we desired 


(13.7.13) 


(13.7.14) 


(13.7.15) 


(13.7.16) 


a D (u) = 


a(u ) = 


V2 r dx^/x~=u 

n J\ Vx 2 - 1 ’ 
•Jl /*’ dx-s/x — u 

n J -1 y/x 2 — 1 


(13.7.17) 


Similarly, the techniques that we have used can be generalized in the case 
of SU(n ), where we now define a torus with genus g = n — 1. As before, we 
can write a form X whose integrals over cycles give us a t and a D j. 

The point of investigating Seiberg-Witten theory is that many of the tech¬ 
niques pioneered here, such as duality, modular invariance, etc., carry over to 
superstring theory, allowing us for the first time to probe the nonperturbative 
region of string theory. 



456 13. Seiberg-Witten Theory 

References 


1. For a review, see K. Intriligator and N. Seiberg, Nucl. Phys. Proc. Suppl 45BC, 1 
(1996). 

2. N. Seiberg, Nucl. Phys. B435, 129 (1995). 

3. N. Seiberg, Phys. Lett. 318B, 469 (1993). 

4.1. Affleck, M. Dine, and N. Seiberg, Nucl. Phys. B241, 493 (1984); Nucl. Phys. 
B256, 557 (1985). 

5. N. Seiberg and E. Witten, Nucl. Phys. B426, 19 (1994). 

6. N. Seiberg and E. Witten, Nucl. Phys. B431, 484 (1994). 

7. C. Montonen and D. Olive, Phys. Lett. 72B, 117 (1977). 

8. E. B. Bogomol’nyi, Soviet. J. Nucl. Phys. 24, 449 (1976); M. K. Prasad and C.M. 
Sommerfield, Phys. Rev. Lett. 35, 760 (1975). 

9. A. Klemm, W. Lerche, S. Theisen, and S.Yankielowicz, Phys. Lett. B344, 169 
(1995). 

10. P. Argyres and A. Faraggi, Phys. Rev. Lett. 74, 3931 (1995). 

11. M. Douglas and S. Shenker, Nucl. Phys. B447, 271 (1995) 



CHAPTER 14 


M-Theory and Duality 


14.1 Introduction 

Einstein once wondered whether God had any choice in creating the universe. 
He suspected that the constraints on creating the universe were so stringent 
that there might only be a unique solution. 

Therefore, one of the persistent mysteries of string theory is why there should 
be five finite and totally self-consistent superstring theories. If string theory is 
to fulfill the dream of a theory of all quantum forces, then one suspects that it 
should be unique, rather than suffering from a five-fold multiplicity. Although 
the heterotic Eg <8> Es string is the leading candidate to explain the low-energy 
spectrum of particles, one wonders what roles, if any, are played by the other 
four superstring theories. 

The lesson from the previous chapter was that duality in supergauge theories 
[1,2] was a powerful constraint on supersymmetric theories, allowing one 
to show the equivalence between two seemingly different theories. We sus¬ 
pect, therefore, that duality might show that two superstring theories, which 
have entirely different low-energy spectra, may be identical when one includes 
nonperturbative effects. 

In fact, we will show that duality is powerful enough to show the following: 


• Perturbative duality relations, called T duality, can show the equivalence 
between Type IIA and IIB strings, as well as the two heterotic strings. 

• More important, nonperturbative duality relations, called 5 duality, can 
reveal the strong coupling behavior of the other string theories, showing 
that the IIB string is self-dual, and that the Type I string is dual to the 
50(32) heterotic string. 


458 14. M-Theory and Duality 


• Surprisingly, nonperturbative 5 duality relations can be established be¬ 
tween the Type IIA string and a new type of 11 -dimensional theory, called 
M-theory [3-8], Veiy little is known about M-theory, except that it re¬ 
duces to 11 -dimensional supergravity in the low-energy limit and contains 
both membranes and 5-branes. This allows us to unify all five superstring 
theories into a single master theory. 

• Entirely new, soliton-like objects, called D-branes, will play an essential 
role in completing this nonperturbative picture. D-branes will have a va¬ 
riety of important applications, such as deriving the Bekenstein-Hawking 
radiation formula for black hole entropy via statistical mechanics. 

• When N parallel D-branes become coincident, the world volume theory 
reduces to the action of U ( N ) supergauge theory. 

• Dual relations can also be established between four-dimensional SU(N ) 
supergauge theories and 10-dimensional anti-de Sitter space. This means 
that superstring theory can provide persuasive arguments that four¬ 
dimensional gauge theories, like QCD, exhibit confinement and a mass 

gap- 

In summary, we find that duality is emerging as perhaps the most powerful 
tool in our arsenal to explore the previously forbidden nonperturbative region 
of superstring theory. 


14.2 Unifying the Five Superstring Theories 

Before duality was introduced, it was puzzling that string perturbation theory 
could be written self-consistently via five superstring theories, each with a 
different low-energy supergravity spectrum: 

• Type IIA string theory reduces to N = 2A (nonchiral) supergravity. 

• Type IIB string theory reduces to N = 2B (chiral) supergravity. 

• £g 0 E% heterotic string theory reduces to N = 1 supergravity coupled to 
an E% 0 E% Yang-Mills multiplet. 

• 50(32) heterotic string theory reduces to N = 1 supergravity coupled to 
a 50(32) Yang-Mills multiplet. 

• Type I string theory, which contains both open and closed strings, reduces 
to N = 1 supergravity coupled to an 50(32) Yang-Mills multiplet. 

Each theory is quite distinct from the others. For example, Type I strings 
are unoriented, breakable, and come with both open and closed strings. All the 
others are based on oriented, closed strings and are unbreakable. Also, both 
the 50(32) Type I string and the 50(32) heterotic string have the same low- 
energy spectrum, so it was thought that they may somehow be identical. But 
the Type I string has both open and closed strings, while the 50(32) heterotic 
string has only closed strings. 



14.3 T Duality 459 


Similarly, the first quantized action for each of these theories is quite distinct. 
Consider the Green-Schwarz formalism, which is invariant under space-time 
supersymmetry. In 10 dimensions, the smallest spinor representation of the 
Lorentz group, the Majorana-Weyl spinor, is 16 dimensional. In Type IIA and 
IIB strings, we have two sets of these spinors. In the Type IIA(B) theory, the 
two spinors 6 X and Q 2 are of opposite (identical) chiralities. This will allow 
us to write N = 2 supersymmetry. Likewise, the heterotic string is based on 
a single chiral spinor with only N = l supersymmetry. For each of these 
theories, we can write the global invariant nf: 


n? = 


3iX" - / 0 +r' 4 3,0+, 

diX* - wr* 3,6, 

3,2F - iS jk 9ir^ 3,6»*, 


Heterotic, 

IIA, 

IIB. 


(14.2.1) 


Then the Nambu-Goto part of the first quantized action is given by 


Sij — . 


5 , = 



(14.2.2) 


The above action is only invariant under global supersymmetry, not local 
supersymmetry. The complete locally supersymmetric action is obtained by 
adding the Wess-Zumino-like term: 


hwz — 


rF* dd+r^ dd+, 
rF dOT^Tude, 
s, 7 if d8' + r, L del, 


Heterotic, 

IIA, 

IIB, 


(14.2.3) 


where Ti i is the product of all T matrices and 5 i; is a 2 x 2 matrix which 
equals the Pauli matrix a z . h v, z can be expressed as /iwz = db. If one carefully 
extracts b from /iwz, then the Wess-Zumino-like part of the action is given by 


S 2 = y J d 2 a € ij bij. (14.2.4) 

Then the string action equals Si + S 2 . 

If we generalize this construction by adding (unoriented) open and closed 
strings, we obtain the Type I string. Similarly, we can write heterotic strings 
which are free of anomalies only for gauge groups Eg ® Eg and SO( 32). This 
gives us five superstrings. 


14.3 T Duality 

The first duality that was discovered was T duality [9-11 ], which is perturbative 
in nature. Let us begin with a 10-dimensional field theory and compactify it 



460 14. M-Theory and Duality 


on Si . As usual, the Kaluza—Klein modes are quantized according to 

P = J (14.3.1) 

for a circle of radius R and an integer n. 

For strings, however, we have additional modes, corresponding to a closed 
string wrapping itself m times around Si: 

( Pl > ?*) =(j£ +mR ' mR ) ’ 04.3.2) 

where the n, as usual, arises from the Kaluza-Klein excitations of the circle, 
but m labels the number of times the string winds around the circle. 

Notice that the mass spectrum for M 2 is invariant under [9]: 

* - A (14.33) 

when we interchange n ■<-* m. 

This is a highly unusual symmetry, one which links the large-scale behavior 
of string theory to its small-scale structure. It means that a string theory defined 
on a compactified radius R is, perturbatively, indistinguishable from its dual 
theory compactified on radius 1/2 R. Unlike point-particle field theory, the 
string cannot differentiate between these two regions. Notice that this duality 
symmetry, which interchanges winding modes with Kaluza-Klein modes, is 
strictly a result of the geometry of string theory and its winding modes and 
does not appear in point-particle theories. 

If we reexpress this duality in terms of conformal field theory, we can write 
it as 


dX 8X, 
dX -> -dX. 


(14.3.4) 


Notice the signs change for one set of movers. 

Now the interesting part comes when we introduce fermions. In the Neveu- 
Schwarz-Ramond formalism, this duality generalizes to 


+1 -> ~fl 
fi K- 


(14.3.5) 


Now here is the key point. This dual transformation reverses the sign of 
the 10-dimensional left-moving chirality operator, constructed from fermionic 
zero modes 


I'll = tl't'ltl-i'l -Hi. (14.3.6) 

We conclude that the T duality transformation flips the chirality of the 
strings. We can show that this symmetry persists to all orders in perturbation 
theory. Thus, the nonchiral Type IIA string compactified on a circle of radius 
R is indistinguishable perturbatively from the chiral IIB string compactified 



14.3 T Duality 461 


on a circle of radius 1/2 R. We denote this by [10]: 

T : IIA «-» IIB. (14.3.7) 

In other words, the Type IIA and IIB theories are really the same theory. 
They are just two extreme points along a continuum of vacua created by the 
compactification process. If we take R —> 0 or oo, we can recover one theory 
or its dual. 

Similarly, this T duality can be extended to the case of the heterotic string. 
As we recall, the vacua of the heterotic string can be expressed in terms of the 
Narain lattice [12], If we compactify the heterotic string down to d dimensions, 
this means that the left-moving sector has 26 — d dimensions that have curled 
up, and the right-moving sector has 10 — d dimensions compactified. These 
dimensions can be compactified onto a lattice T L and T R . 

If we construct the single-loop amplitude for heterotic strings, we find that 
modular invariance forces us to have lattices which are even and self-dual. In 
addition, it forces us to impose the constraint 

r L ■ r L - r R • r R = z, ( 14 . 3 . 8 ) 

where Z refers to the integers. Notice that the minus sign indicates that this 
is a Lorentzian lattice. There is, however, an additional degree of freedom 
which is not fixed by modular invariance. We can always rotate the lattice by 
SO(26—d, 10 —d) and still satisfy modular invariance, with each configuration 
representing a different vacuum [11]. But there are still redundancies within 
SO(26 — d, 10 — d). For example, the mass operator M 2 is invariant under 
separate 50(26 — d) 0 50(10 — d) rotations acting separately on the left- 
and right-moving modes. Finally, we must divide out by the action of the T 
duality, which is a discrete group which merely reshuffles the points on the 
lattice without changing the physics 

T = 50(26 -d, 10- d,Z). (14.3.9) 

This is a generalization of the T duality we found earlier which mixes discretely 
R 1/2 R. 

In summary, the moduli space of inequivalent vacua can be represented by 

50(26 — d, 10 — d) 

Moduli Space = g S0(1Q _ d) g, r ■ 

(14.3.10) 

where the number of independent parameters in the moduli space M m ,„ is mn. 
(In the literature, - Vfyc, d.io-d is often written as 

50(26 -d, 10 - d, Z)\50(26 -d, 10 - d)/SO(26 - d) 0 50(10 - d), 

(14.3.11) 

but for clarify, we shall use the notation which simply divides the naive moduli 
space by all the redundant factors.) 

We find that the T -duality group 50(26 — d, 10 — d, Z) contains both the 
Eg 0 Eg heterotic string as well as the 50(32) heterotic string as extremal 



462 14. M-Theory and Duality 

points in the same space. Symbolically, we find [12]: 

T : Eg < 8 > Eg ** 50(32). (14.3.12) 

Thus, we now have gone from four superstrings down to three. 


14.4 S duality 

14.4.1 Type IIA and M-Theory 

To explore the implications of duality for the Type IIA string, it will be helpful 
to remind ourselves of the history of the old D = 11 supergravity theory [13], 

This theory is the largest supergravity action compatible with a maximum 
of spin-2 particles. Originally, it held great hope of being a unified field theory. 
However, it suffered from a series of fatal flaws. First, it was probably nonrenor- 
malizable. Using superspace methods, one could show that supersymmetric 
counterterms could be written for the theory. Unless a miracle cancelled the 
coefficients of these counterterms, the theory was nonrenormalizable. Second, 
the theory, when compactified on a manifold, could not yield the chirality found 
in the Standard Model. As a consequence, this theory was largely ignored for 
many years. 

However, duality forces us to reconsider both of these fatal objections. We 
will find that the 11-dimensional supergravity theory is just the low-energy 
sector of a higher theory, hence these higher-order corrections may cancel the 
divergences. Second, if one compactifies on nonmanifolds (e.g., line segments) 
then one can introduce chirality. 

We begin by writing the bosonic action for D = 11 supergravity, based on 
a metric tensor and on an antisymmetric third-rank tensor Amnp ; 

S=-!t/>,| V z J [k + J.| F |2] 

'Ml J 

+ Mn ^’’ (14.4.1) 

where F Ml ...m 4 is the field tensor constructed out of antisymmetrized derivatives 
of A Mi m 2 m 2 - 

Our goal is to compare this 11-dimensional action with the 10-dimensional 
action for IIA strings. In addition to the graviton g^ v and dilaton 0, we have 
an antisymmetric, second-rank tensor B coming from the product of two 
Neveu-Schwarz fields. (We recall that the bosonic sector of the closed string 
in the Neveu-Schwarz-Ramond formalism comes from the product of two 
NS operators or two R operators, coming from the left and right movers, i.e., 
the bosonic spectrum is spanned by the states NS L <g> NS/? or R L 0 R r . R-R 
states exist for Type IIA, IIB, and I strings, but there are no R-R states for the 
heterotic string.) 



14.4 S duality 463 


In the NS-NS sector, Type IIA and Type IIB have the same massless fields 
NS-NS: (14.4.2) 

But the R-R sector for the Type IIA theory contains the additional fields: 

R-R: {C^A^}. (14.4.3) 

The low-energy action for IIA strings is just the 10-dimensional nonchiral 
supergravity with N = 2 supersymmetry. If we let K = dC, H = dB , and 
G = dA, then the action for the massless fields (to lowest order) is 

S = j d w x{J=ie- 2<l> [R + 4\d<t>\ 2 -\\H\ 2 ] 

- V=g[\K \ 2 + nld 2 ]} + ^JGaGaB. (14.4.4) 

Years ago, it was noticed that the D — 11 supergravity theory, when com- 
pactified on a circle, reduced to 10-dimensional supergravity if we threw 
away all the Kaluza-Klein modes. We find the following break-down of the 
11-dimensional fields into 10-dimensional fields: 

#MN (gfiv* Cp, (j>) , ^ ^ 

A MNP ^ ^/xv) 5 

where the radius of the eleventh dimension is given by: 

R n =e* /3 . (14.4.6) 

More precisely, the 11-dimensional metric tensor and antisymmetric field 
can be decomposed as 

ds 2 — guxdx M dx N = dx* dy v 

+ e 4 ^ 3 (dy - dx^crf , (14.4.7) 

A = | dx M A dx v A dx p A^ vp + jdx^ A dx v A dyB (JLV . 

So far, we have done nothing new. But the reanalysis of Witten and Townsend 
has revolutionized our understanding of this well-known correspondence [14, 
15]. 

Before, we threw away the Kaluza-Klein modes. But now let us retain these 
modes and the keep the compactification radius R u finite. If we analyze the 
relationship between the D = 11 supergravity action compactified on a circle 
of radius R\\ and the D = 10 Type IIA action, we see a relationship between 
Rn and e*. But e* is equal to the string coupling constant. More precisely, we 
have 

Rn = ( gs) 2/ 3 - (14.4.8) 

This is a remarkable relationship, because it means that the strong coupling 
region of Type IIA string theory corresponds to a D = 11 theory, whose low- 
energy action is given by D = 11 supergravity. Also, the Kaluza-Klein modes, 



464 14. M-Theory and Duality 


which we previously threw away, can now be reinterpreted as bound states of 
the Type IIA string. (We will return to this question of bound states and BPS 
states shortly.) 

From this relationship, we see clearly why this correspondence between 
D — 10 superstrings and a hidden, D = 11 theory, called M-theory, was 
missed. To any finite order in perturbation theory, we would never see this 
11-dimensional theory, which only emerges fully in the g s -» oo limit. Plus, 
bound states are notoriously difficult to analyze, and hence the Kaluza-Klein 
modes (re-interpreted as BPS states) was also missed. 

Furthermore, we can also see how the E$ <g> £ 8 heterotic string fits into 
this picture [16]. Since this theory is chiral, we cannot expect to find it via a 
compactification on a manifold. However, 5, /Z 2 is not a manifold. It is a circle 
with opposite points identified, i.e., it is a line segment. This discrete symmetry 
acts on Si via x 11 -»■ —.v ! 1 . This additional constraint places a restriction on 
the fermions of the theory, making them chiral and reducing N = 2 to N = 1 
supersymmetry. 

In general, this process of introducing chirality introduces anomalies. How¬ 
ever, since M-theoryitself is supposedly anomaly-free, this means that the 
compactification process introduces new terms, at the endpoints of the line 
segment, which cancels these anomalies. Since the only groups which are 
anomaly-free are E% <8> E% and S 0(32). and since we have to introduce these 
groups at the ends of the line segment, we have the freedom to place an E% 
gauge theory at each endpoint. This, in turn, yields the chiral, N = 1, E% ® E$ 
heterotic string. 

In summary, we have shown, at least to lowest order, that Type IIA string 
theory is 5 dual to a new, D = 11 theory called M-theory, whose lowest-order 
term is given by D = 11 supergravity. Also, M-theory, when compactified on 
a line segment, is dual to the E% <g> £ g string 

S: M-theory on Si <+ IIA, 

S: M-theory on Si/Z 2 -o- £ 8 ® E%. (14.4.9) 


14.4.2 Type IIB 

S duality was first conjectured in string theory along the lines found in super¬ 
gauge theories, i.e., as a symmetry of the torus [17-20]. Applying the same 
techniques to the Type IIB string, it was found that it is self-dual. 

The Type IIB theory has, in addition to the fields coming from the NS-NS 
sector, fields coming from the R-R sector: another antisymmetric, second-rank 
tensor B’ iv , a scalar field /, and a fourth-rank, antisymmetric tensor C,,., CT/; : 

R-R: {/,R; U ,C^ P }. (14.4.10) 

The low-energy action for Type IIB theory is 

S = J d 10 x^g{e~ 2,l> [R + 4\d(j>\ 2 -\\H\ 2 ] - 2\dl\ 2 



14.4 S duality 465 


IH | 2 - ^|M + | 2 } - £ / C + A A H\ (14.4.11) 

where we have dropped all fermion terms and all higher terms in the curvature, 
where H = dB and H' — dB\ and where M = dC and M is self-dual. (In 
this action, we have deliberately neglected the rather subtle point that there 
exists no simple covariant action of a self-dual antisymmetric field.) 

To simplify matters, we let g^ e^^g^, which gives us 

S = J d l0 x^g{R - 2 [| d<j >\ 2 + e 2 *\dl\ 2 ] - ±\M + \ 2 - | 2 

- \e!*\H' -IH\ 2 } - ^/C + aH aH\ (14.4.12) 

We recall that the Type IIA theory is M-theory compactified on a circle 
of radius Ru . We also recall that the Type IIA theory is T dual to Type IIB 
strings when compactified on a circle of radius J?i 0 . Therefore, we expect 
that, to lowest order, supergravity compactified on a torus Si x Si should be 
equivalent to Type IIB strings compactified on a circle. In particular, we expect 

/1 /I A 1 \ 

Sub = —. (14.4.13) 

*M0 

But notice that the Si ® Si forms a torus, and the modular group of the torus 
is SL(2, Z) . In particular, a torus described by Ru/R\o is related to a torus 
described by R\o/Rn- But this means that the Type IIB theory described by 
coupling constant g is equivalent to the Type IIB theory described by 1 /g 9 i.e., 
the theory is self-dual This is a nontrivial prediction of M-theory. To check 
this prediction, we first notice that the Type IIB action is indeed invariant under 
an SL(2, R ) symmetry given by 


r 


ar + b 
cr + d' 


(14.4.14) 


where 


r = l + ie~*, (14.4.15) 

where a, b,c,d are real numbers obeying ad — be — 1, where l is the axion 
field, and where we simultaneously make the transformation on the two-forms: 



More specifically, we can show this invariance by introducing the matrix 



(14.4.17) 


which transforms under SL(2, R ) as 


M = AMA t . 


(14.4.18) 



466 14. M-Theory and Duality 


We can also put H and H' into the column matrix H , such that 

H = ( ) , H -» (A r r'ff. (14.4.19) 

Then the Type IIB action (after a simple rescaling) which is manifestly 
invariant under SL( 2, /?) is given by 

S = jd™x^g(R - ±Hl vp MH^ + i Tr(^M (14.4.20) 

where we have dropped the fermionic terms, the self-dual tensor, and the higher 
interactions. 

When this classical symmetry of the lowest-order action is quantized, 
5L(2, R ) reduces to the subgroup 5L(2, Z), so 5 duality is given by 

S = SL( 2, Z). (14.4.21) 


If we set / = 0, this 5L(2, Z) symmetry, as a subset, contains the important 
invariance 


0 -0. (14.4.22) 

In other words, the strong coupling of Type IIB theory is revealed to be another 
Type IIB theory! In summary 

5: IIB IIB. (14.4.23) 


14.4.3 Type I Strings 

In summary, we have shown that Type IIA and B strings are T dual to each 
other, as are the two heterotic strings. Also, Type IIB string theory is self 
dual under S duality given by 5L(2, Z), and Type IIA theory is 5 dual to a 
mysterious D=11 theory called M-theory. Lastly, we wish to see how Type I 
strings fits into this picture [21-22]. 

Type I theory is quite different from the other theories. Based on unoriented 
strings, it can break, and hence consists of both open and closed strings. Gauge 
symmetry is introduced into Type I strings via Chan-Paton factors, which are 
traces over isospin matrices multiplying the amplitudes. 

Let us use M-theory to make some predictions for Type I strings. We will 
take M-theory and compactify it in two different ways, on either [S\/Z 2 ] <8> S\ 
or S\ ® [Si/Z 2 ], i.e., we can always reverse the order of compactification. At 
the end, we will compare the two theories we obtain by two different methods 
of compactification, and identify them. This will reveal the dual nature of the 
Type I theory. 

In the first method, we compactify M-theory on S\ /Z 2 with a length given by 
L, we arrive at the E% ® E s heterotic string. If we compactify again on S \, with 
radius given by R , which breaks the symmetry down to S<9(16) ® 5(9(16), 



14.4 5 duality 467 


then we have compactified M-theory on a cylinder, with radius R and length 
L. 

Now let us compare this resulting theory with Type I and the S 0(32) theories 
compactified on a circle of radius R. The effective low-energy bosonic Type I 
action is given by 

S = j dx'°x^g{e- 2,l, [R + 4\d(t>\ 2 ]-e^Trl^l 2 -\\H'\ 2 }. (14.4.24) 

The R and d(j> terms appear multiplied by e~ 2<l> , as expected, since they are 
defined on the sphere. (However, the H = dB field is missing.) The T term is 
accompanied by a factor of e~* because the Yang-Mills terms are associated 
with the open string sector, which is defined on the disk (with Euler number 
1) rather than the sphere. And the H r term comes from the R-R sector, and 
hence has no 0 dependence. So all factors in the Type I action have the correct 
worldsheet structure and 0 dependence. 

The 50(32) heterotic action is given by 

S = j d w x^ge~^[R + 4\d<t>\ 2 -\\H\ 2 -a'Tx\^\ 2 ]. (14.4.25) 

Notice that this action has the correct 0 dependence. Since the heterotic string 
has no R-R sector, we find that the entire 50(32) action is multiplied by a 
factor of e~ 2<t> , as expected. 

In the second method, we reverse the order of compactifications: we first 
compactify first on 5i, which gives us Type IIA theory, and compactify again 
on 5i /Z 2 , where we break N = 2 supersymmetry down to N = 1. By carefully 
analyzing the structure of this N = 1 theory, we find that it is a T dual version 
of Type I theory, with group structure 50(16) 0 50(16). Since both theories 
must be the same, we therefore have a relationship between Type I theories 
and heterotic strings 


5: 50(32) I. (14.4.26) 

Comparing these two theories (which were derived by simply reversing the 
order of the compactification of M-theory) we find 

R 

81 — 1/550(32) = — • (14.4.27) 

When this technique was applied to the Type IIB theory, we found that it 
was self-dual because we were compactifying on a torus with symmetry group 
5L(2, Z). Here, however, we are compactifying on a cylinder, and hence we 
do not expect the resulting theory to be self-dual. In fact, what we find is that 
Type I strings are 5 dual to 50(32) strings. 

More precisely, we have the following transformation which converts the 
Type I theory into the 50(32) theory 



468 14. M-Theory and Duality 


<t> -<f>, 

B' -> B, 

A -*■ a!A. (14.4.28) 


14.5 BPS States 

Our brief introduction into dualities raises many deep questions. First, we know 
very little about M-theory, other than the fact that it contains 11-dimensional 
supergravity and reduces to Type IIA theory when compactified. Also, the 
dualities we found were discovered by analyzing the low-energy actions of 
superstrings, and hence may not hold up nonperturbatively. 

To remedy this problem, we will introduce the formalism of BPS states. Be¬ 
cause of the nonrenormalization theorems found in supersymmetric theories 
we do not expect these BPS states to be renormalized. Thus, we expect 
these BPS states to persist nonperturbatively, and hence allow us to link two 
seemingly different theories. 

For example, let us analyze the two-dimensional supersymmetric action 

L = -{( a- iv 2 (0) - \V\mt, (14.5.1) 

where V(<f>) = A(0 2 — a 2 ) and ^ is a Majorana fermion. This theory is 
supersymmetric, with two chiral supercharges given by 

Q± = f dx(i ± <f>')ir ± T V(cP)ir T , (14.5.2) 

where \j/± are the left- and right-handed components of xj/. The supersymmetry 
algebra is given by: 

Q\ = P+, 

Qi = P-, 

{Q + , QA = T, (14.5.3) 

where P± = Po ± Pi, and T is the central charge. From the algebra, we find 
the relationship 

2Pq = P + + P_ = (G + + <2-) 2 - T — {Q+ - Q-) 2 + T. (14.5.4) 

Now sandwich this relationship between eigenstates of Po? where the particle 
is at rest. We find 

M > \T. (14.5.5) 

This is the Bogomol’nyi bound, which establishes a relationship between the 
masses and charges of these states. Those states which saturate this bound 
are the BPS states. We find the equality is satisfied when we sandwich this 
equation between two states | s) which obey ( Q + ± Q-)\s) = 0, i.e., for states 



14.5 BPS States 469 


which are annihilated under precisely half of the supercharges. These are the 
BPS saturated states, which are not renormalized in perturbation theory. 

This trivial two-dimensional example can be generalized to the four¬ 
dimensional case 

[Q^Qi}= ^(y^CUP^ + UHCU + V^iCysU (14.5.6) 

for i = 1, 2,..., N, which contains N(N — 1) central charges contained 
within the antisymmetric matrices U and V. In particular, for N = 4 and 
N = 8 superalgebras, we have 12 and 56 central charges, respectively. If we 
sandwich this algebra between eigenstates of P 0 , where we assume that the 
state is at rest, then we have 


M > k\Z\ (14.5.7) 

for some constant k , where |Z| symbolically represents the charges that we 
obtain from U and V. In other words, the masses and charges are now related 
by this condition, which we expect will survive the process of renormalization. 

Let us now determine the size of the representations of these BPS states. Let 
us, for the moment, diagonalize the anticommutator {Q a , Qp) so we bring it 
into the form 


{Qa, Qp} = /«,*, (14.5.8) 

where f a ,p is diagonal. By rescaling, this can be brought into the form of a 
delta function. 

Notice that this defines a familiar Clifford algebra (if we replace the super¬ 
charges with Dirac matrices y^, except that a is a spinor index, not a vector 
one). We use the familiar result that for an /V-dimensional Clifford algebra, 
the representations are 2 N/2 dimensional (for N even). In general, these 2 N/2 
dimensional representations of the Clifford algebra are non-BPS. 

Now assume that f a ,p has some zero eigenvalues. Let M represent the num¬ 
ber of nonzero eigenvalues. Then we can rewrite the super algebra such that 
the Clifford algebra holds for a, P < M, i.e., {Q a , Qp] = S at p 9 and the 
super-algebra becomes Grassmann for a, fi > M , i.e., {Q a , Qp] = 0. 

If we diagonalize the nonzero components of f a ^ 9 then the Clifford algebra 
is M dimensional, and its representations are 2 M/2 dimensional. These are the 
BPS states. For a > M, we can define the states of this representation such that 
QM) — 0. In other words, the BPS representation breaks supersymmetry for 
those indices a < M. 

We saw that IIA and IIB strings have 32 supersymmetry generators. Thus, 
if M — 32, the “long” multiplet is 2 16 = 256 2 dimensional and is non-BPS. 
But if M = 16, then the representation of the reduced Clifford algebra is 
2 8 = 256 dimensional. This is called the “ultra-short” multiplet, and it is BPS. 
Notice that it preserves half of the original supersymmetries. We can also have 
M = 24, which only breaks 3/4 of the space-time supersymmetry. This forms 
a 2 12 -dimensional representation, called the “short” multiplet. 



470 14. M-Theory and Duality 


For the heterotic string, we have only 16 supercharges. The long multiplet 
(which is non-BPS) is therefore 256 dimensional. If M = 8, we break half 
of the supersymmetries, and we have the short BPS multiplet which is 16 
dimensional. We can also have M = 12, which is 64 dimensional and is called 
the intermediate representation. 

Originally, these states were largely ignored. These BPS states were dropped 
entirely when discussing the superalgebras of many theories, since they indi¬ 
cated the presence of membranes and higher soliton-like states, which were 
missing in the original founding papers on supersymmetry. To fully appreciate 
the power of these terms, we must now introduce the theory of /?-branes. 


14.6 Supersymmetry and p-Branes 

We recall that in ordinary electrodynamics, we couple a point-particle to a 
source via the interaction 

j d D x A^r, (14.6.1) 

where the source is given by 

r(x) = j dr 8° (. Xll - X^r)) , d t x»(r), (14.6.2) 

where X^(r) labels the location of a point-particle. This reproduces the familiar 
formalism of Maxwell’s theory of point-particles. We call a point-particle a 
0-brane. 

A string (a 1-brane), labeled by X couples to a massless, background 
second-rank tensor source B^ as follows: 

j d D x B^r, (14.6.3) 

where 

j^(x) = j dr da S D ( Xfl - X„(r, a)) € ij 9, djX v . (14.6.4) 

Inserting the current into the coupling, we find the usual term coupling the 
string to the massless background field 

j dr da e ij 9,X^ djX v B„ v . (14.6.5) 

For a general p-brane labeled by we couple it to a massless, 

background (p + l)-rank antisymmetric gauge potential variable via 



(14.6.6) 



14.6 Supersymmetry and p-Branes 471 


where 

j IM ' p2 . /ip+l = j d p+ 'a 8° ( Xfl - X„(a,))€ J ' ^ + ' d h X^ 1 ...d jp+ ,X^'. 

(14.6.7) 

Now construct the field tensor associated with the p-brane potential 

= 9m.^W..m, + 2 + permutations. (14.6.8) 

The key is to construct the dual theory. We therefore introduce another field 
tensor F' such that it equals the dual to F: 

*F^ p+2 = F’^..„ q+2 . (14.6.9) 

The left-hand side is a [D — (p + 2)]-rank tensor. The right-hand side is a 
(q -+- 2)-rank tensor. Since these two numbers must be equal, we therefore have 
the relation 


p + q = D — 4 (14.6.10) 

which gives us the dimension q of the dual of a p-brane. For example, in four 
dimensions, the dual of an electron (0-brane) is another 0-brane, the monopole. 
In 10 dimensions, the dual of a string is a 5-brane. Likewise, in 11 dimensions, 
the dual of a membrane is a 5-brane. 

Lastly, we can construct the electric and magnetic charges corresponding to 
these p-branes. We find 

Qe * f •F, 

JSo-p -2 

Qm- [ F, (14.6.11) 

JS P+ 2 

where we have encircled the p-brane by a hypersphere. (The definition of 
electric versus magnetic charge, as we see here, is a bit arbitrary, since we can 
reverse them by taking the dual.) 

If we actually construct the p-brane actions and their superalgebras, we 
find, in general, that a p-brane is associated with a pth rank tensor 
which appears on the right-hand side of the superalgebra. This is the key 
point. By analyzing the tensors which are found on the right-hand side of the 
superalgebra, we can trivially read off the number of p-branes which exist 
nonperturbatively in the theory, without ever constructing them! 

To see the power of this method, consider the 11-dimensional superalgebra. 
In 11 dimensions, one can construct Majorana spinors with 32 components. 
There should be 528 = (33 x 32/2) possible terms on the left- and right-hand 
sides of the algebra. The algebra, including the central terms, is given by 

{Q a , Q P } = (r M c) a/5 p m + (r M "c) a/! z MN 

+ (r MNPQR C) af} Z MNPQR . (14.6.12) 



472 14. M-Theory and Duality 


Each central charge term on the right-hand side corresponds to a p-brane. 
Counting states, we find that they contribute 

11 +55 + 462 = 528 states, (14.6.13) 

as expected. Thus, we expect to find a membrane and its dual, a 5-brane in 
11 dimensions. This is a powerful result. Without ever constructing them, we 
know that M-theory must contain at least a membrane and a 5-brane. 

Now let us apply the BPS algebra to 10-dimensional string theory, where 
we have two Majorana-Weyl spinors, each with 16 components. We now have 
two spinors Q' a . one for each chirality. If they have opposite chiralities in 10 
dimensions, then they can be recombined into a single spinor, giving us the 
algebra for the Type IIA superstring 

{Q a , Qp) = (r M C)^ P M +(T n C) aP Z 

+ {T M T n C) ap Z M + (T MN C) ap Z MN 

+ (r MNPQ r n c) a(s z MNPQ 

+ (r MNPQR C) afi Z MNPQR . (14.6.14) 

Counting states, this reduces to 

10 + 1 + 10 + 45 + 210 + 252 = 528 states, (14.6.15) 

as expected. These, in turn, correspond to p = 0, 1,2, 4, 5 branes. 

This algebra gives us a wealth of unexpected information: 

• The algebra shows that there are 1-branes, which corresponds to the orig¬ 
inal Type IIA string, as expected. However, we also find a series of other 
states. 

• The original P M of the 11-dimensional theory has split off a singlet, corre¬ 
sponding to scalar Pi 0 and a 10-dimensional vector. This Pio corresponds 
to Kaluza-Klein states created by compactifying the 11-dimensional 
theory. But from the 10-dimensional perspective, these states occur as 
nonperturbative bound states. 

• This explains the mystery found before when showing the equivalence 
between compactified M-theory and Type IIA strings. The Kaluza-Klein 
modes of M-theory are dual to the bound states of the Type IIA string. 

• Notice that the 5-brane of the 11-dimensional theory reduces to the 4- 
brane and 5-brane of the 10-dimensional theory, and that the 2-brane of 
the 11-dimensional theory reduces to the 1-brane and 2-brane of the 10- 
dimensional theory. We will have a better understanding of these odd¬ 
looking states when we introduce D-branes in the next chapter. 

Similarly, for the Type IIB string, we have two chiral spinors Q l a , where the 
algebra is given by 

{ Q ‘ a , Qp} = 5 y {VT M c) ap p M + (pr"c) a/S z y 



14.7 Compactification 473 


+ e ij C PT MNP C) ap Z MNP + S" {VY MNPQR C) a pZ+ NPQR 
+ (pr MNPQR C) ap Z+% PQR , (14.6.16) 

where V is a chiral projection operator, and the tilde refers to traceless 
symmetric SO(2) tensors. Counting states, this gives us 

10 + (2 x 10) + 120 + 126 + (2 x 126) = 528 states, (14.6.17) 

as expected. This, in turn, corresponds to p — 1,3,5 (and also p — — 1, which 
corresponds to an instanton). In contrast to Type IIA strings, Type IIB strings 
are associated with odd p-branes. 


14.7 Compactification 

Now let us use this machinery developed in the last few sections to make 
statements about duality in lower dimensions. Our goal is to find dual relations 
in four dimensions, which will allow us to explore the nonperturbative structure 
of field theories, one of which may describe our physical universe. 

Since the lowest-order approximation to superstrings is just supergravity, 
it will be useful to reexamine the nature of D = 11 supergravity when it is 
compactified down to lower dimensions. When the heterotic string is examined 
in the low-energy sector, we find supergravity coupled to various Yang-Mills 
fields via the action 

L = ,/=*(R - d p <p j ~ \m u F l ^F Jilv + •••), (14.7.1) 

where <p' are various scalar fields which take values in a target space M. 
with metric g,y(0), and F lpv represent the abelian field strengths. We will be 
interested in the equations of motion for the scalar fields <j > 1 , which are invariant 
under some symmetry group G, such that M is the homogeneous space G/AT, 
where H is the maximal compact subgroup of G. When examined in detail, 
we find the following chart for the low-energy heterotic string, which is just 
supergravity coupled to certain Yang-Mills fields (see Table 14.1), where D 
represents the number of uncompactified dimensions and G is the symmetry 
group of the equations of motion for the scalar fields. (The superscript (1) 
appearing for the heterotic string compactified down to D = 2 refers to the 
affine group.) 

Originally, the meaning of these groups (representing the moduli space of 
vacua of the scalar fields of supergravity theory) was rather obscure. Now, we 
can reinterpret these symmetries in light of superstring theory. We find that 
the group G, when discretized, becomes the T -duality group. Also, when we 
combine the 5-duality group with the T -duality group, we find that there is 
actually a much larger group which contains both groups as subgroups, and 
this is called U duality, which is also nonperturbative in nature. 



474 14. M-Theory and Duality 


TABLE 14.1. 


D 

Supergravity Group G 

T Duality 

U Duality 

10 

0(16) x 50(1, 1) 

0(16, Z) 

0(16, Z) x Z 2 

9 

0(1, 17) x 50(1,1) 

0(1, 17, Z) 

0(1,17, Z) x Z 2 

8 

0(2,18) x 50(1,1) 

0(2, 18, Z) 

0(2, 18, Z) x Z 2 

7 

0(3,19) x 50(1,1) 

0(3, 19, Z) 

0(3, 19, Z) x Z 2 

6 

0(4,20) x 50(1,1) 

0(4, 20, Z) 

0(4, 20, Z) x Z 2 

5 

0(5,21) x 50(1,1) 

0(5, 21, Z) 

0(5, 21, Z) x Z 2 

4 

0(6,22) x 5L(2, R) 

0(6, 22, Z) 

0(6, 22, Z) x SL{ 2, Z) 

3 

0(8,24) 

0(7, 23, Z) 

0(8, 24, Z) 

2 

0(8, 24) (1) 

0(8,24, Z) 

0(8,24) (1) (Z) 


For the low-energy behavior of Type II strings, we find supergravity coupled 
to gauge theory in Table 14.2. 


TABLE 14.2. 


D 

Supergravity Group G 

T Duality 

£/ Duality 

10A 

50(1,1)/Z 2 

I 

1 

10 B 

5L(2, R ) 

1 

5L(2, Z) 

9 

5L(2, /?) x 0(1,1) 

Z 2 

5L(2, Z) x Z 2 

8 

5L(3, fl) x 5L(2, R) 

0(2,2, Z) 

5L(3, Z) x 5L(2, Z) 

7 

SL(5, R) 

0(3, 3, Z) 

5L(5, Z) 

6 

0(5, 5) 

0(4,4, Z) 

0(5,5, Z) 

5 

E m 

0(5, 5, Z) 

E m (Z) 

4 

£ 7 ( 7 ) 

0(6, 6, Z) 

E u 7) (Z) 

3 

^8(8) 

0(7, 7, Z) 

^8(8)(Z) 

2 

E 9(9) 

0(8, 8, Z) 

^9(9)(Z<) 

1 

E 10(10) 

0(9, 9, Z) 

^io(io)(Z) 



14.8 Example: D = 6 475 


14.8 Example: D = 6 

In searching for dualities, there are many ways in which to approach the 
problem: 

• we can match the low-energy supergravity theories between two string 
theories; 

• we can compare the moduli spaces of possible vacua between two string 
theories; 

• we can match the BPS states and spectra of the two string theories; and 

• for chiral string theories, we can demand that all anomalies vanish. 

Yet another way is to start with 11-dimensional M-theory, and exploit the 
duality between the membrane and the 5-brane. If we simultaneously com¬ 
pactify the membrane and the 5-brane on the same manifold, then we arrive at 
two theories which appear to be quite different, but are actually dual to each 
other. In this way, starting with the membrane/5-brane duality in M-theory, we 
can derive a web of lower dualities in lower dimensions. 

For example, we can wrap the membrane around some one-dimensional 
manifold called Afi, and obtain a string theory. We can subsequently compact¬ 
ify the remaining theory around some four manifold called M 4 . Thus, we have 
compactified the membrane on the manifold M\ <g> Af 4 . 

We can also reverse this process. We can wrap the 5-brane around M 4 , in 
which case we obtain a string theory, and then compactify it again on M\ . Thus, 
we have compactified the 5-brane on the manifold M 4 ® M x . If we compare 
the two resulting theories, they should be dual to each other. 

We can, in turn, choose M\ to be either Si or a line segment Si/Z 2 . In 
addition, M 4 can be either K 3 or T 4 . 

To see how this works, let us wrap the membrane around Si, so we arrive 
at Type IIA strings. Then we compactify on K 3 . The resulting theory must be 
dual to compactifying the 5-brane first on K 3 and then Si. 

The result of this sequence of compactifications is that we can show that the 
Type IIA string compactified on K 3 is dual to the heterotic string compactified 
on T 4 . 

Let us summarize this by the following chart. 

(N+, AL) M[ W 4 Fundamental String Dual String 

(1,0) S\/Z 2 K 3 heterotic 

(1,1) Si K 3 Type IIA 

(1.1) Si/Z 2 T 4 heterotic 

(2.2) Si T 4 Type IIA 

Let us take Mi = Si or Si/Z 2 , and M 4 = T 4 or K 3 . Then we arrive at 
four possible string theories by compactifying the membrane on Mi ® M 4 , 




476 14. M-Theory and Duality 


which should be dual to the four string theories we obtain by compactifying 
the 5-brane on M 4 0 M\. 

We can then place the various D — 6 dualities on a chart [1-2]: where 
N ± represents the number of chiral supersymmetries that survive in four 
dimensions. 

Let us now analyze some of these dualities for the case of D = 6, which is 
a laboratory for the much more difficult case of D = 4. 


14.8A D = 6, N = ( 2 , 2 ) Theory 

Often when we analyze the descending web of dualities between different 
string theories, we find that there are some basic discrepancies, such as a mis¬ 
match between the two symmetry groups. Upon closer examination, however, 
we find that these mismatches actually reveal hidden symmetries and novel 
features which are not immediately obvious, such as enhanced symmetries 
and U duality. 

Consider the case D = 6, N = (2, 2), which, according to the chart, yields 
a self-duality of the Type IIA theory in six dimensions. 

First, we can compactify the membrane of M-theory on T 5 , or Type IIA 
theory on T 4 . The Type IIA theory compactified on T 4 has the standard T- 
duality group 50(4, 4, Z). 

Second, we can compactify the five-torus of M-theory on T 5 . This, in turn, 
yields a moduli space of the toroidal modular group SL( 2, Z). 

At this point, we seem to have a contradiction, since the two groups are quite 
different. The duality relationship seems to be broken. However, to reconcile 
these two facts, we must consider the fact that the smallest group which contains 
both groups as subgroups is 5 0(5, 5, Z). It can be shown that the real symmetry 
group of both theories is this larger group, 5 O (5, 5, Z), which we call U duality 
(which contains the T - and 5-duality groups as subgroups). 

We can use U duality to explain the discrepancy in other compactifications. 
For example, if we compactify 11-dimensional supergravity on a (c + 1) di¬ 
mensional torus, with c > 5, we get the toroidal moduli space of SL(c +1, Z). 
This should be dual to compactifying the Type IIA theory on a odimensional 
torus, yielding the T -duality group 50(c, c, Z). Again, we have a discrepancy 
between these two groups. To reconcile these two facts, we note that they are 
subgroups of the noncompact form of E c +\( Z), which is the U -duality group. 
Thus, we find that 


U = £ C+1 (Z), c > 5. (14.8.2) 


14.8.2 D = 6, N = ( 1 , 1 ) Theories 

Now let us analyze the case of D — 6, N = (1, 1). This yields the highly non¬ 
trivial duality between Type IIA theory compactified on K 3 and the heterotic 
E$ 0 E s string compactified on T 4 [23]. 



14.8 Example: D — 6 477 


We suspect that they might be dual, since they have the same low-energy 
limit: D = 6, N = 2 supergravity coupled to 20 abelian super-Yang-Mills 
multiplets. This six-dimensional theory has 4 x 20 = 80 scalar fields and four 
vector fields contained within the supergravity multiplet, so there are 20 + 4 
vector (7(1) fields altogether. 

Now let us compare the moduli spaces of the two theories. The heterotic 
string compactified down to six dimensions has the moduli space 


50(4,20) 

50(4) <8) 50(20) ® T’ 


(14.8.3) 


which is 4 x 20 = 80 dimensional, as expected. The six-dimensional theory 
also has 80 scalar fields. (The metric g pv and B pv yield 4x4 = 16 scalar 
fields. The E% 0 E% Yang-Mills field A“ gives us 4 x 16 = 64 scalar fields. 
The sum yields 80 scalar fields, as expected.) 

We have therefore the following scalar states: 


g llv : 10 scalars, 

B pv : 6 scalars, (14.8.4) 

A a : 64 scalars. 

Also, A a yields 16 vector fields. When we combine this with eight vector 
fields contained within g pv and B pv , we wind up with 24 f/(l) vector fields, 
as expected. 

We can also analyze the fields in terms of representations of the little group 
in six dimensions, which is 50(2) 0 50(2). If we label the multiplicities of the 
various states under the little group, we find the following group representations 
for the bosonic states 


graviton : (3, 3), 
tensor: (3, 1) or (1, 3), 
vector : (2,2), 

scalar: (1,1). 


(14.8.5) 


If we decompose the various 10-dimensional fields according to this six¬ 
dimensional classification by the little group, we find 


(3, 3)+ 4(2, 2)+10(1,1), 

B pv : (3,1)+ (1,3)+ 4(2, 2)+ 6(1,1), 

: 16(2, 2) + 64(1, 1), 1 ' ' 

4 >' (1,1), 

which yields 80 scalar fields described by (1,1), 24 vector fields described by 
(2,2), and one dilaton, as expected. 

Now let us compare this with the Type IIA theory compactified on AY We 
find the same number of vector fields. A )L from the 10-dimensional theory 
contributes one vector field. The C pvp tensor contributes 22 vectors. There is 





478 14. M-Theory and Duality 


also one extra vector because A^ vp in six-dimensional space is dual to a vector. 
There are thus 1 + 1 + 22 = 24 (7(1) vector fields, as expected. 

However, the heterotic and Type IIA theories have vastly different fields, 
with the heterotic theory possessing Eg ® Eg Yang-Mills fields but the Type IIA 
theory has nothing like this in comparison, so we seem to have a discrepancy. 
But this is resolved once we realize that the moduli space of the Type II 
theory receives contributions from several different sources, including the K 3 
manifold and other scalar fields, so that the two theories in six dimensions 
actually have precisely the 80 scalar fields. 

The moduli space of K 3 is 


R + ® 


50(3, 19) 
50(3)® 50(19)’ 


(14.8.7) 


which is 3x19 = 57 dimensional plus one additional dimension for the 
volume of the space. Thus, the 10-dimensional metric g^ v can be decomposed 
into a six-dimensional metric plus an additional 58 scalar fields corresponding 
to the moduli of K 3 . 

The true moduli space of Type IIA theory compactified on K 3 receives 
contributions from the bosonic fields 0 , Z? MV , C M , A^ vp in addition to the 
graviton. 

To calculate the number of fields that can live on K 3 , we note that the number 
of massless bosons on is equal to the number of harmonic p-forms on K 3 . 
This, in turn, is given by the Betti numbers for the manifold, which are: b 0 = 1, 
b\ = £>3 = 0, bj — 3, b^ = 19, and = 1, i.e., there are three self-dual two- 
forms and 19 anti-self-dual two-forms that can be defined on this space. There 
are thus 3 + 19 = 22 generators. These 22 two-forms living on K 3 transform 
as scalars under the six-dimensional Lorentz group. In other words, the B^ 
field reduces to the six-dimensional tensor plus an additional 22 scalar fields, 
while the other fields contribute nothing. 

Thus, altogether, Type IIA string theory compactified on K 3 has 58 + 22 = 
80 scalar fields, the same as in the previous case. 

We find therefore the following number of states: 


V • 

(3, 3)+ 58(1, 1), 

B„ v : 

(3, 1) + (1,3) + 22(1,1), 

CpLVp • 

23(2,2), (14.8.8) 


( 2 , 2 ), 

<t>: 

0 , 1 ). 

In conclusion, we find that the moduli spaces of the two theories are actually 
the same, once we factor in the contributions coming from fields other that the 


metric. We find therefore that 


IIA on K 3 Eg ® Eg on 7+ 


(14.8.9) 




14.9 F-Theory 479 


14.8.3 Deletions and Fibrations 

A number of simple devices has been devised to derive more classes of dual¬ 
ities. Of course, these tricks have to be rigorously proven, but these devices 
have so far survived all the usual consistency checks. 

If Si appears on both sides of a dual relationship, one may, for example, 
“delete” or “lift” this Si on both sides of the duality, yielding yet another dual 
relationship. 

For example, Type IIA string theory compactified on K 3 can be written as 
M-theory compactified on S\<& K 3 . This, in turn, is dual to the heterotic string 
compactified on T 4 . If we “delete” Si on both sides of the dual relationship, 
we find 

| IIA on K 3 E s ® £g on T 4 , 

{ M-theory on K 3 *-> E% ® E% on T 3 . 

(Notice that both have the same T -duality group S0(3, 19, Z) and the same 
moduli space SO( 3, 19)/S0(3) ® S0(19) ® T, as expected.) 

Yet another method of deriving new dual relationships is by “fiberwise” du¬ 
alities. In this chapter, we have analyzed manifolds which are simple products 
of two submanifolds. However, it is possible to combine two submanifolds in 
a different way, by fibration. Let A 1 and A 2 be two manifolds. At each point on 
A 2 , we define a distinct manifold A \. Thus, A 2 will serve as a base manifold, 
and Ai will serve as a fiber. We will denote this fibration by M A = A\ over A 2 . 

Now let theory A compactified on Ai be dual to theory B compactified on 
a manifold B\. Then we suspect the following dual relationship also holds 

Aon Ai ** B on B\ =» A on [A\ over A 2 ] ** B on [B\ over A 2 ]. (14.8.11) 

This relationship, of course, still must be rigorously checked by comparing 
BPS states, moduli spaces, etc. 

This fiberwise duality is useful when the manifold K 3 appears, since a cer¬ 
tain class of K 3 manifolds can be written as fibrations over Pi, the complex 
projective plane. In particular, a class of K 3 are elliptic fibrations of complex 
projective space Pi, i.e., 


K 3 ~ T 2 over Pi. 


(14.8.12) 


14.9 F-Theory 

We now mention briefly F-theory [24]. Ideally, F-theory is to Type IIB strings 
as M-theory is to Type IIA strings. Although this is a fertile source of new dual 
relationships, we will also find some conceptual obstacles to this construction. 

We begin by noticing that the SL(2, Z) modular symmetry of Type 
IIB strings can be viewed as arising from the compactification of some 
12-dimensional theory on a torus. 



480 14. M-Theory and Duality 


To establish this duality, let Type IIB theory be compactified on a manifold 
M. Then we have the following: 

F on (T 2 over M) IIB on Af, (14.9.1) 

i.e., F-theory compactified over an elliptic fibration of a manifold M is equiv¬ 
alent to IIB theory compactified over M. One can take this as the definition of 
F-theory. This construction is possible because the dilaton-axion field (which 
is usually taken to be a constant over the compactification manifold) is now 
taken to be a function of every point on the manifold. 

Also, a large class of dualities for F-theory can be derived by noticing that 

F on Si ® M a oMon M A (14.9.2) 

for some manifold M A . 

Although the construction of the elliptic fibrations seems rather complicated, 
there is a straightforward way in which to manipulate such spaces. 

We recall from the previous chapter that the equation 

y 2 =x 3 + fx+g (14.9.3) 

defines a torus, where x and y are each complex variables. For each value of 
the complex parameters / and g, we have a torus. Now let / and g be defined 
as functions of the complex variable z, which is defined over a base manifold, 
such as Pi. Then for every point in the base manifold, we have a torus. This 
is precisely the construction that we desire, i.e., a torus T 2 defined over every 
point on some base manifold. 

A large class of interesting manifolds can be constructed in this fashion. 
For example, if /(z) is a polynomial of degree 8, and g(z) is a polynomial of 
degree 12 in z, then the elliptic fibration of P] yields a K 3 manifold. 

Using the various tricks that we have compiled, it is now straightforward to 
prove the following identities: 


F-theory on T 2 

44 

IIB, 

F-theory on A 

44 

IIB on B, 

F-theory on T 2 /Z 2 

44 

50(32), 

F-theory on K 3 

44 

Eg <g> E s on T 2 , 


for A being an elliptic fibration of B. 

The last identity, for example, can be obtained by deleting T 2 from both 
sides of the duality between IIB strings on K 3 being dual to the heterotic string 
on T 4 . This last identity, in turn, can be transformed using fiberwise duality 
via Pi. We note that one type of K 3 can be expressed as an elliptic fibration 
K 3 ~ T 2 over Pi. On the left-hand side, we therefore want to compactify over 
K 3 over Pi. On the right-hand side, we want to compactify over T 2 over Pi. 
The resulting duality is then 

F on K 3 o E% ® E% on T 2 

=>► F on [K 3 over 4> E% ® £ 8 on [T 2 over Pi] 



14.10 Summary 481 


=> F on CY ** £ 8 (g> E% on K 3 (14.9.5) 

where the CY is a Calabi-Yau manifold given by K 3 over Pi. Clearly, we can 
derive a large number of dual relations with F-theory. 

F-theory has already proven its usefulness in generating a large class of dual 
relationships. However, it is not clear if this 12-dimensional theory is really a 
fundamental theory or not. 


14.10 Summary 

Duality, when applied to string theory, can be used to show that all five string 
theories are different phases of the same theory. In particular, in 11 dimensions, 
the Type IIA theory can be shown to be dual to a new theory, called M-theory, 
which reduces to D = 11 supergravity in the low-energy limit. 

If we summarize the main dualities, we can show the link between all five 
superstring theories. T duality allows us to make the following links: 

IIA ** IIB, 

£ 8 ® £ 8 50(32), (14.10.1) 

while 5 duality relations can be written between the following theories: 

M on Si ** IIA, 

IIA ** IIB, 

M on [Si/Z 2 ] ** Eg 0 J? 8 , 

SO(32)**I. (14.10.2) 

In particular, because the S-duality relations are expressed as 0 4* —0, it 
means that the strong coupling region of one theory is mapped into the weak 
coupling region of the other theory. In particular, when we compactify M- 
theory on a radius R 11 , we find the relationship R u = (g s ) 2/3 , which shows that 
we would never have seen the relationship between M-theory in 11 dimensions 
and IIA theory in 10 dimensions to any finite order in perturbation theory. 

Although the above dual relationships were originally found by equating the 
low-energy theories of the various string theories, we can also show that these 
dual relationships persist nonperturbatively by comparing their BPS states, 
which are not renormalized because of the nonrenormalization theorem. 

The key to understanding these BPS states is the p-branes. Consider a 
(p + l)-rank-tensor antisymmetric gauge potential coupled to a p-brane. The 
coupling is given by 

j d D x (14.10.3) 

where 

r(x) = f dr S D ( Xfi - X„(T» d T X»(r), (14.10.4) 



482 14. M-Theory and Duality 


where X M (o/) is a generalization of the string variable usually found in string 
theory. 

Now construct the field tensor associated with the p-brane potential 

/V..AV +2 = a Mi^W..Av +1 + permutations. (14.10.5) 

We can introduce another field tensor F' such that the dual of F is identified 
with F', i.e., 

*F^ p+1 = F’^., llq+2 . (14.10.6) 

Since F f is the field tensor corresponding to yet another tensor potential 
corresponding to a #-brane, we then have the condition 

p+q = D- 4, (14.10.7) 

which gives us the dimension of the dual of a p-brane. 

Now consider the full 11-dimensional superalgebra, including the central 
terms 

{ Q a , Qf>} = (r M c) afS Pu + (r MN c) afj z MN 

+ {r MNPQR C) afS Z MN P Q R. (14.10.8) 

Each central charge term on the right-hand corresponds to a p-brane. 
Counting states, we find that each term contributes: 

11 +55 +462 = 528 states (14.10.9) 

where the 55 states correspond to a membrane, and the 462 states correspond 
to a 5-brane. 

What is rather remarkable about this simple analysis is that we can determine 
the existence of the complete set of BPS p-branes for the theory without ever 
having to construct them! 

For example, consider the 10-dimensional Type IIA algebra 

(2*, Qf >} = {T M C) ap P M + z + {T m C)^Zm + {T MN C) ap Z MN 

+ (r MNPQ c) afi z MNPQ 

+ {T MNPQR C)^Z M npqr. (14.10.10) 

Counting states, this reduces to 

10 + 1 + 10 + 45 + 210 + 252 = 528 states. (14.10.11) 

By analyzing these equations, we see that there should be BPS states for even 
p-branes. Similarly, for the Type IIB superalgebra, we find BPS states for odd 
p-branes. 

Of what use are these p-branes? We will find that they play a crucial role 
in defining the nonperturbative structure of the theory. For example, there was 
the long-standing puzzle of what were the sources for the Ramond-Ramond 
fields. We recall that the Type IIA theory had massless Ramond-Ramond fields 



14.10 Summary 483 


given by {C p , A pvp }, while the Type IIB theory had Ramond-Ramond fields 
given by {/, B' MV , C^ v<jp }. But out of the string variable it was impossible 
to construct a source for these fields. Thus, the string had a net charge under 
NS-NS fields, but had zero charge under the R-R fields. Now we see that the 
Type IIA(B) theory actually has nonperturbative states given by even (odd) 
p-branes which can act as sources for the R-R fields. 

This is important for the Type IIB theory, for example, because the SL(2, Z) 
symmetry rotates the tensor field B into B\ so there must be new objects which 
carry the charge associated with B\ 

In this chapter, we have seen how the dualities found in higher dimensions 
naturally lead us to dualities in lower dimensions. In particular, we find that 
the nonperturbative region of one compactified string theory often yields yet 
another apparently unrelated string theory. For example, Type IIA string theory 
compactified on K 3 is dual to the heterotic string compactified on T 4 . 

There are many ways to see how this duality works. The simplest way is 
to compare two seemingly unrelated string theories which have the same low- 
energy structure. Specifically, we shall analyze the supersymmetry generators 
which survive the compactification process. If we begin with a supersymmetry 
generator Q a in 10 or 11 dimensions and then begin to compactify it down 
to lower dimensions, we find that it decomposes into Q a i , where a labels the 
spinor index a lower dimension and i labels the number of supersymmetry 
generators. In this way, once we know the holonomy group of the manifold 
on which we are compactifying a theory, we can calculate the number of 
supersymmetry generators N that survives the compactification process. 

In particular, the case D = 6 has been analyzed extensively. We begin 
with the fact that M-theory in 11 dimensions contains both a membrane 
and its dual, a 5-brane (which is given to us by analyzing the BPS super- 
symmetry algebra). We can compactify both the membrane and 5-brane on 
a five-dimensional space, given by the product of a one-dimensional space 
Mi and a four-dimensional space M 4 . The resulting theories in six dimen¬ 
sions should be dual to each other, since they were dual to each other in 11 
dimensions. 

We can let Mi = Si or Mi = Si /Z 2 . Also, we can let M 4 = K 3 or M 4 = T 4 . 
In this way, we now have four ways in which to compactify the membrane and 
its dual. In this way, we now obtain four possible dualities. 

Perhaps the most interesting case is D = 6 and N = (1, 1), which establishes 
a heterotic/Type II duality. Evidence for this duality is given by analyzing the 
low-energy structure of both theories, which are not obviously the same. 

The moduli space of the heterotic string compactified on T 4 is given by the 
Narain lattice 


SO(4, 20) 

S0(4) ® S0(20) (g) T ’ 


(14.10.12) 


which is 4 x 20 = 80 dimensional. This, in turn, must match the number of 
scalar fields of the supergravity theory. To see this, we note that the metric g pv 



484 14. M-Theory and Duality 


and B pv yield 4x4= 16 scalar fields after compactification. Similarly, the 
^8 ® Yang—Mills field A^ gives us 4 x 16 = 64 scalar fields. The sum 
yields 80 scalar fields, as expected 


Sfiv • 10 , 

: 6, (14.10.13) 

A*: 64. 

Also, we can calculate the gauge group which survives the compactification 
process. A* yields 16 vector fields. When we combine this with eight vector 
fields contained within g^ v and we wind up with 24 C/(l) vector fields, 
as expected. 

Now compare this with the Type IIA theory compactified on K 3 . The count¬ 
ing of scalar states is much trickier, because we must carefully analyze the 
moduli space of K 3 , given by 


R+ ® 


50 ( 3 , 19 ) 
50 ( 3)0 50 ( 19 )’ 


(14.10.14) 


which is 3x19 = 57 dimensional plus one additional dimension for the 
volume of the space. Thus, the 10-dimensional metric g^ v can be decomposed 
into a six-dimensional metric plus an additional 58 scalar fields. 

Similarly, A^ contributes one vector field. The C MV/0 tensor contributes 22 
vectors. There is also one extra vector because A pvp in six-dimensional space 
is dual to a vector. There are thus 1 4- 1 4- 22 = 24 t/(l) vector fields, as 
expected. 

We find therefore the following number of states: 


(3,3)4-58(1,1), 

<t>: (1,1), 

A m : (2,2), (14.10.15) 

B^ : (3, 1)4- (1,3) + 22(1,1), 

C pvp : 23(2,2), 

so there are 80 scalar states described by (1,1) and 24 vector states described 
by (2, 2). 

Lastly, we observe that K 3 preserves half the supersymmetries after com¬ 
pactification, so we have (1,1) instead of (2,2), which agrees with the previous 
(1,1) supersymmetry we found for the heterotic string compactified on T 4 . 
Thus, the fermionic fields of the two theories match. 

In summary, we find that the low-energy actions of the two theories have 
the same number of fields, the same moduli space, and the nonperturbative 
relationship when we let 0 — (j>. 

This result can be generalized by analyzing the supergravity moduli spaces 
for various dimensions spanned by the scalar fields. We expect that these mod¬ 
uli spaces for supergravity theories can be generalized to the moduli spaces 




14.10 Summary 485 


for superstrings by making the duality groups discrete. Fortunately, the moduli 
spaces for 11-dimensional supergravity compactified down to various dimen¬ 
sions have been cataloged long ago, and represent the first step in establishing 
dual relationships between two dissimilar string theories. By comparing the 
moduli space, the supersymmetry group, and BPS states, one can therefore 
establish a number of dual relationships. 

Much more difficult (and more interesting) are the compactifications to 
D = 6, A = 1 orD = 4, N = 2, which represent the cutting edge of research. 
Results for these two cases will shed much light on the physically relevant case 
of D = 4, N = 1. Not surprisingly, we find that these compactifications are 
highly nontrivial because of the introduction of Calabi-Yau manifolds. 

A large web of dualities can also be derived by using certain tricks. For 
example, if the circle Si is present on both sides of a dual relationship, we 
can simply “delete” it, and find a new dual relationship. (Of course, this new 
relationship still has to survive other consistency checks.) Or we can also derive 
new dual relationships via “fiberwise” duality. If we have two manifolds A\ 
and A 2 , we can combine them via a fibration by defining a manifold A\ at each 
point on the manifold A 2 . Thus, A\ becomes the fiber, and A 2 becomes the 
base. If theory A compactified on manifold A i is dual to theory B compactified 
on manifold B \, then we are led to believe that theory A on manifold [A i over 
A 2 ] is dual to theory B on manifold [B\ over A 2 ]. 

Many of these results can, in turn, be derived by postulating the existence of 
a 12-dimensional theory which, when compactified, becomes Type IIB string 
theory. We recall that we can define an SL( 2, Z) symmetry at each point of 
the manifold, which defines a fiberbundle. We therefore define F-theory as the 
theory when compactified on an elliptic fibration, yields the Type IIB theory. 

We can summarize many of these relationships via 


F-theory on T 2 4-> IIB, 

F-theory on A «-> IIB on B , 

F-theory on T 2 /Z 2 4> 50(32), 

F-theory on K 3 4* E% ® E% on T 2 , 


(14.10.16) 


for A being an elliptic fibration of B. Whether F-theory, is a genuine, 
fundamental theory remains to be seen. 


References 


1. C. Montonen and D. Olive, Phys. Lett B72, 117 (1977). 

2. E. Witten and D. Olive, Phys. Lett B78B, 97 (1978). 

For reviews of M-theory, duality, and membranes, see Refs. 3-8. 

3. J. H. Schwarz, Lectures on Superstring and M-Theory Dualities , TASI Summer 
School, World Scientific, Singapore (1996). 



486 14. M-Theory and Duality 


4. J. Polchinski, TASI Lectures on D-Branes, TASI Summer School, World Scientific, 
Singapore (1996). 

5. P. K. Townsend, Proceedings of the 1996ICTP Summer School in High Energy 
Physics and Cosmology , June 10-26 (1996). 

6. M. J. Duff, Supermembranes , TASI Summer School, World Scientific, Singapore 
(1996). 

7. C. Vafa, Lectures on Strings and Dualities , Feb. 1997, hep-th/9702201. 

8. A. Sen, An Introduction to Non-perturbative String Theory , hep-th/9802051. 

9. K. Kikkawa and M. Yamasaki, Phys. Lett. B149, 357 (1984). 

10. M. Dine, P. Huet, and N. Seiberg, Nucl Phys. B322, 301 (1989). 

11. For references, see A. Giveon, M. Porrati, and E.Rabinovici, Phys. Rep. 244, 77 
(1994). 

12. K. Narain, Phys. Lett. B169, 41 (1986); K. Narain, H. Sarmadi, and E. Witten, 
Nucl. Phys. B279, 369 (1987). 

13. E. Cremmer and B. Julia, Phys. Lett. 80B, 48 (1978); Nucl. Phys. B159, 141 
(1979). 

14. E. Witten, Nucl. Phys. B433, 85 (1995). 

15. P. K. Townsend, Phys. Lett. B350, 184 (1995). 

16. P. Horava and E. Witten, Nucl. Phys. B460, 506 (1996); Nucl. Phys. B475, 94 
(1996). 

17. A. Font, L. Ibanez, D. Lust, and F. Quevedo, Phys. Lett. B249, 35 (1990). 

18. S. J. Rey, Phys . Rev. D43, 526 (1991). 

19. J. H. Schwarz and A. Sen, Nucl. Phys. B411, 35 (1994); Phys. Lett. 312, 105 
(1993). 

20. A. Sen, Int. J. Mod. Phys. A9, 3707 (1994); Phys. Lett. B329, 217 (1994). 

21. E. Witten, Nucl. Phys. B433, 85 (1995). 

22. J. Polchinski and E. Witten, Nucl. Phys. B460, 525 (1996). 

23. C. M. Hull and P. K. Townsend, Nucl. Phys. B438, 109 (1995). 

24. C. Vafa, Nucl. Phys. B469, 403 (1996). 



CHAPTER 15 


D-Branes and 
CFT/ADS Duality 


15.1 Solitons 

In the previous chapters, we introduced soliton and soliton-like objects, e.g. 
the BPS states. These states, in fact, are often the key to establishing nonpertur- 
bative dualities since it is believed that these BPS states are not renormalized. 
However, we have not actually constructed these objects and explored their 
dual properties. This will be the subject of this chapter [1-4]. 

We begin by noting that we can construct the equations of motion of the 
superstring moving in a background metric, given by ordinary supergravity. 
By solving these equations for the supergravity fields, we can find classical 
configurations which correspond to the various p-branes. 

Let us start with a generic action often found in string theory defined in 
D-dimensional space-time, governed by the action 

s = ^/ (1511) 

where F p+1 is the usual antisymmetric field strength corresponding to the field 
which couples to the p-brane, and <p is the usual dilaton. 

This can be coupled to the p-brane action. We introduce the p-brane variable 
X M (£), which is now a function of the variables § which parametrizes the 
p-brane world volume 

S P = T j d p+l i; (i Jgg ij d'X M djX N g MN e aMp+l) + Vg 
- (^TI)! 6 ' 1 ' 2 ■ i ' +1 9/1 X M '...d ip+l X M ’+'A Ml ., 


(15.1.2) 


488 15. D-Branes and CFT/ADS Duality 


where g lJ is the metric on the world volume of the p-brane. The first term 
represents the generalization of the Nambu-Goto action for a p-brane, and the 
last term represents the coupling of the massless (p + 1) rank antisymmetric 
field to the p-brane variables. 

Although these coupled equations seem formidable, we can solve them by 
assuming a simple ansatz, 4> = (p(y), where x M = (x M , y m ) and 

— — det (g^v) €n,ix 2 -iJ.p+i > 
ds 2 = e 2A dx» dx^ + e 2B dy m dy m , ( ' ' 

where p = 0, 1, 2,..., p represent the p world volume parameters, and m = 
P + 1, p + 2,..., D — 1. Similarly, we split the p-brane coordinates as follows: 
(X M = X 11 , Y m ), and choose X^ — and Y m = constant. 

If we insert the ansatz into the coupled equations, we find that there is a 
solution if we make the following identifications: 


D-p -3 

a a 2 

-<f) = —(C - a<p 0 /2) + a<t>o/2, 
a 2 = 4 - 2(p + 1)(D - p - 3 )/(D - 2), 

e-^ + k/y*, D-p- 3>0, 
e-a^ 12 -{ K 2 T/Tt)\ny, D-p- 3 = 0, 


where d = D — p — 3,k = 2k 2 T/(D — p — 3)fi D _ / ,_ 2 , £2 is the volume of a 
hypersphere, and is the vacuum expectation value of <f>. 

It is now useful to write the specific solution for various p-branes. We recall 
that M-theory has both membranes and 5-branes in 11 dimensions. Inserting 
these values into the ansatz, we find that the membrane solution is given by 
the supergravity metric 

/ k \- 2/3 / Jr \ l/3 

ds 2 = yl + dx^dxp + l 1 + ) (dy 2 + y 2 dQ 2 ), (15.1.6) 

where d £2 7 is the volume form for the S 7 sphere, and the four-form field strength 
is proportional to the dual of the volume form on S 7 . (This solution, it can be 
shown, is actually divergent at the origin, meaning that it probably corresponds 
to a fundamental solution.) 

Similarly, the 5-brane soliton of the D = 11 supergravity theory is given 
by splitting the vector x M — (jc m , y m ) where /x = 0, 1, 2, 3, 4, 5 and m = 



15.2 Supermembrane Action 489 


6 ,10. Then the metric tensor is given by 

^ 2= (l + “?) dx»dx li + (\ + 1 ^ (dy 2 + y 2 dQ 2 ), (15.1.7) 

and the four-form field strength is proportional to the volume form on S 4 . 
(Unlike the membrane solution, the 5-brane solution is finite at the origin, so 
it is probably not fundamental.) 

Since we can construct these p-brane solutions to the equations of motion, 
we find that string theory necessarily contains these p-brane states. In other 
words, even if we start with a theory defined purely on strings, we will neces¬ 
sarily have to introduce these p-brane states. Moreover, dual relations can be 
written between the various p-brane states, so the definition of what is “funda¬ 
mental” and what is “composite” is rather obscure. One viewpoint is to adopt 
p-democracy, i.e., all p-branes are of equal importance, since via duality we 
can convert them into other p-branes. String theory, however, may be slightly 
“more equal” than the other p-branes, in the sense that string theory has a 
well-defined perturbation theory, while membranes, as we shall see, do not. 


15.2 Supermembrane Action 

Not only can we construct these p-brane states corresponding to classical 
solutions to the supergravity equations, we can also write their supersymmetric 
actions in the Green-Schwarz formalism [5]. 

To construct these actions, let us first count the number of physical states in 
the action. To have a supersymmetric action, the number of physical bosonic 
and fermionic modes must be equal to each other. The p-brane coordinate 
has D degrees of freedom, but we have to subtract the degrees of freedom 
corresponding to reparametrization invariance in (p + l)-dimensional space. 
Thus, if we only have an action with and 9 as the variables, then we must 
have 


D-p-\ = \MN, (15.2.1) 

where M equals the dimension of the spinor and N is the number of supersym¬ 
metries in the theory. We have to divide by 4, since, by local kappa invariance, 
we halve the number of fermion fields, which is halved again when going on- 
shell. This relation is easily satisfied for the string (in 10 dimensions) and the 
membrane (in 11 dimensions) where the right-hand side is equal to 8. 

Now let us write the action for the membrane in the Green-Schwarz 
formalism. Let us introduce 



490 15. D-Branes and CFT/ADS Duality 


where 0 is a spinor defined in D-dimensional space. Notice that this 
combination is invariant under the global supersymmetry transformation 

SX fJL = i€T^e j 86 = € . (15.2.3) 

Then the first part of the action is given by a simple generalization of the 
Nambu-Goto action 

Si = -T J d p+l crj -det n, • FI;. (15.2.4) 

Although Si is trivially invariant under a global supersymmetry transforma¬ 
tion, it fails to transform correctly under a local supersymmetry transformation, 
and hence the number of fermionic and bosonic degrees of freedom do not 
match. 

To correct this problem, we introduce a second contribution to the action, 
which is a Wess-Zumino term. We begin by introducing an invariant form h 
defined by 


h = 2^! n ^- nA “ de ' (15-2.5) 

Notice that this form h is invariant under the previous global supersymmetry 
transformation. The point of introducing h is that we can now introduce a form 
b , where 

h = db, (15.2.6) 

where we demand that dh = 0. Because dT\^ = i ddF^ dO , we have 

(ddF^ d6)(d~6F^ p d6) = 0. (15.2.7) 

This, in turn, forces us to have 

( r ^) (a / rM1 ' " >lp 'P)yS) = 0, (15-2.8) 

where P is the chirality projection operator, if 0 is a chiral spinor. For the case 
p = 1, this yields the well-known constraint that D = 3,4, 6, 10, which gives 
us the Green-Schwarz string. This new identity, however, forces us to obey a 
new constraint, which is given by contracting the identity with (T v )“^. After a 
bit of work, we find once again that D — p — 1 = M N /4, as before. 

Then the Wess-Zumino action is given by 

S 2 = -2 T f *b = - {p l^ f dP+l ° (15.2.9) 

If we write this out in detail for the supermembrane, we find for the Wess- 
Zumino term 

5 2 = -f/ d 3 a {{€ iik 6T^ did) [n?ri£ + d k d 

- \{er P dje){er d k e)]). 


(15.2.10) 



15.3 5-Branes and D-Branes 491 


Now let us check that this action is locally supersymmetric. Let 50 be 
undetermined at this point. Then we find 


&b = ...Wider (15.2.11) 

For the case p — 2, we find 

55 = 2iT j d 3 a 80 ( v '=£* <t r i - 3,0 

= iT j d 3 a 50(1 - cOV^'r, 3,0, (15.2.12) 

where 


r, = nfr M , r y = nfn;r MU , r = 

We see therefore that 8S = 0 if we choose 


6 V^g 


e'^nfn;n^r Mvp . 

(15.2.13) 


c = ±1, 50 = (1 ± I>(or), (15.2.14) 

where |(1 ± T) is a projection operator. 

With the kappa symmetry, we can reduce the 32 components of the spinors 
down to 16. By going on shell, we then have eight degrees of freedom, which 
matches the degrees of freedom from the bosonic sector. 

We should also point out that the membrane action can be reformulated in a 
curved superspace background. Then we must introduce a classical super vier- 
bein E as well as a third-rank super tensor B abc , where A, B,C represent 
the supertangent space indices and M represents the super space-time index of 
the base manifold. When we impose kappa symmetry on this supermembrane 
action, we find that E^ and B abc obey the standard torsion constraint equa¬ 
tions found in D = 11 supergravity theory. In other words, the supermembrane 
action is only self-consistent when it moves in a background given by classical 
D = 11 supergravity. This, in turn, provides a powerful consistency check to 
this formalism. 


15.3 5-Branes and D-Branes 

Originally, when the theory of membranes was being explored, the possibility 
of higher bosonic fields within the /?-brane action besides were largely 
ignored. However, because of the large number of BPS states that must be 
accounted for, we will introduce tensor and vector states as well on the world 
volume. 

For example, if we introduce a tensor field into our counting, we now have 
(D-p- \)+(^ P ~ 2 ^ = \MN, 


(15.3.1) 



492 15. D-Branes and CFT/ADS Duality 


where the first two terms represent the string and two-form degrees of freedom, 
respectively. For D = 11, the solution for this is p = 5, giving us a 5-brane. 

The world volume action for such a 5-brane, however, posed serious prob¬ 
lems. The action must introduce a propagating self-dual antisymmetric tensor 
field. But various no-theorems have been proposed which forbid a naive co¬ 
variant action describing such fields. This means, in general, that we must 
introduce an infinite number of auxiliary fields. However, recently a proposal 
has been made for a much simpler action, involving only a single auxiliary field 
[6, 7]. In this chapter, we will be more interested in constructing the action 
for D-branes [8, 9], which have proven to be pivotal to solving a number of 
problems. 

To see the essential role that D-branes play in string theory, consider the 
Type IIB string, which has two sets of second-rank massless tensor fields B 
and B f . The second tensor field B' comes from the Ramond-Ramond sector 
of the theory. The Type IIB string has a charge only coming from B , since the 
coupling to the source is given by 

e ij d i X ll djX v B IJLV , (15.3.2) 

but there is no coupling of B’ to the world volume in the same fashion. This 
means that the Type IIB has no charge under B'. 

Let us describe a state by (n, m), where n and m represent the charges 
under B and B' . The fundamental Type IIB string, for example, is denoted by 
(1,0). In general, a duality transformation transforms a BPS state with charges 
(n, m) into another state via an SL( 2, Z) transformation. Since the Type IIB 
string does not carry a charge under B\ we must introduce another object, 
the D-string, to carry this charge. Not surprisingly, this D-string was missed 
in earlier investigations of string theory because it is a purely nonperturbative 
object, since the SL(2, Z) symmetry which turns string states in D-string states 
is nonperturbative. 

In addition, there is a surprisingly large family of such D-branes in lower 
dimensions. The key point is that they are BPS states. Hence, they are not renor¬ 
malized by quantum corrections, and can be used to analyze the nonperturbative 
nature of M-theory. 

If we analyze the super translation algebra in 11 dimensions, we recall 
that we only have p — 2 and p — 5 M-branes. But if we compactify the 
super translation algebra, then the p — 2 and p = 5 states in 11 dimensions 
decompose into a very large family of p-branes in lower dimensions, many 
of which correspond to D-branes. In general, we will have even-dimensional 
D-branes for the Type IIA string, and odd-dimensional D-branes for the Type 
IIB string. 

Now let us try to analyze the counting of states for a D-brane. In addition 
to the string variable X^ defining the (p + l)-dimensional world volume, 
we now introduce a vector field, which contributes p — 1 additional degrees 
of freedom to the physical states. So we must modify our previous formula 



15.3 5-Branes and D-Branes 493 


D — p — 1 — \ MN to the following: 

D-brane: D- 2= \MN . (15.3.3) 

Notice that the p has dropped out of the calculation. Comparing this with 
the R-R states generated by Type IIA and IIB strings, we find that D-branes 
can match them perfectly. This allows us to complete the counting of BPS 
states, which is a powerful check on the self-consistency of our nonperturbative 
analysis of duality. 

Although D-branes have now proven their utility in many areas of M-theory, 
originally D-branes were first introduced [8, 9] when studying the T duality 
of the ordinary bosonic string in 26 dimensions. We recall that a closed string, 
under T duality, transforms as 


X^z) + X^z) -> X^z) - X^z). (15.3.4) 

But a curious effect occurs when we make a T -duality transformation on an 
open bosonic string. The standard open string expansion of X^ transforms into 
X M as follows: 






- ia'pu, In zz 

(15.3.5) 

m^O 

(15.3.6) 

- ia'p „ In 

(15.3.7) 

1 ^ m^O 

(15.3.8) 


The key point is that the usual Neumann boundary condition for X M becomes 
a Dirichlet boundary condition for 

d a X fJL = 0-+d T X tl = 0 (15.3.9) 


or = 0 at the endpoints, where X M is the dual string. 

Notice that Dirichlet boundary conditions on the open string imply that the 
endpoint is fixed in space-time along a hyperplane. For the 26-dimensional 
bosonic string, this means that the endpoints of the dual open string lie on a 
fixed (24 + l)-dimensional hyperplane. But since string theory is a theory of 
quantum gravity, this hyperplane must move with time, so we are therefore 
forced to introduce a new object: a new type of p -brane on which strings can 
end, the Dirichlet-brane or D-brane. This is often taken as the definition of the 
D-brane: a p-brane upon which open strings can end. This can be generalized 
for the arbitrary case. If k out of D total dimensions are compactified, then T 
duality leads to a D p-brane with p = D — 1 — k. 

From this, we can also see that D-branes are BPS states. For example, in 
a Type IIB theory, if the parameters of the supersymmetry transformation are 



494 15. D-Branes and CFT/ADS Duality 


given by and e R , for the left- and right-moving sectors, then they have the 
same chirality 

r°--r 9 e L = e L , r° • • • r 9 €* = €R . (15.3.10) 

Now impose the open string boundary conditions, i.e., that we have Neu¬ 
mann boundary conditions for when 0 < p < p, and Dirichlet boundary 
conditions for p + 1 < p < 9. 

By supersymmetry, this condition is reflected in the fermionic modes as 
follows: 


£l = r p+1 • • • r 9 e R . (15.3.11) 

But these two conditions on C/, and e r can only be satisfied for half the 
supersymmetric transformations and only for odd p. Hence, they are BPS 
states. 

Likewise, we can show that for Type IIA strings, e L and € R have different chi¬ 
ralities, and both constraints can only be satisfied for half the supersymmetric 
transformations and only for even p. 

Let us now see how to introduce gauge symmetry into a theory with D- 
branes. Let X 25 represent the T -dual string with Dirichlet boundary conditions 
on the twentyfifth coordinate. If we integrate, then we have 

j da d a X 25 = X 25 (n) - £ 25 (0) = lita'p 25 = Ina'n/R = 2 nnR, 

(15.3.12) 

where we have used R = a! jR. But since we have compactified the twenty- 
fifth direction for the dual theory, it means that both ends of the string must lie 
on the same D-brane. Since each open string carries U( 1) symmetry, we can 
only introduce U(l) N symmetry in this way. 

Non-Abelian gauge symmetry is introduced in Type I strings via Chan- 
Paton factors. For example, we recall that the Type I string incorporates 0(32) 
symmetry because the open strings have isospin matrices attached at each end. 
Vertex functions appear with an explicit Lie algebra matrix X a tj in them, which 
gives rise to a trace over these matrices Tr X a X b X c X d for scattering amplitudes. 

For the Type I string with gauge symmetry, there is the Chan-Paton factor 
| ij) at the end of the string, where these indices are in the fundamental repre¬ 
sentation of U(N), so / = 1,2,..., N. Consider a gauge transformation. The 
ith element will pick up a phase e l6i , the j th element picks up the complex 
conjugate e~ l6j , so the vertex function picks up a phase 

I ij) -► e m ~ 9i) \ij). (15.3.13) 

The vector state | ij) also has momentum p 25 associated with it, which, after 
compactification, also picks up a phase factor under X 25 -> X 25 + 2jiR. We 
find, therefore, that the additional phase picked up by the gauge field yields an 
extra 0i/(2tt R) to the momentum. 



15.4 D-Brane Actions 495 


The possible momentum for the vertex function is now equal to 

p 25 = (inn + Qj — Oi) /(2tcR). (15.3.14) 

In order to determine where the open strings end, take the integral 
/ da 3 0 X 25 . We now find 

X 25 (n) - X 25 (0) = C Inn + Oj - 0t)R . (15.3.15) 

The point is that open strings can start and end on different D-branes. 

If we have N distinct noninteracting D-branes, then we only have U(l) N 
symmetry, since there are only N gauge fields. Our goal, however, is to restore 
U(N) invariance [10]. To do this, let the D-branes gradually merge into one 
D-brane. Then we have 0* —► 6j , and the full U(N) symmetry is restored. The 
vector state | ij) no longer picks up a phase factor from the isospin transfor¬ 
mation. After the D-branes merge, we have N 2 gauge fields, since the gauge 
field arises when an open string connects the ith and y'th D-branes. For N 
D-branes, we naturally have N 2 gauge fields, which is the desired number to 
complete U ( N ) symmetry. 

In summary, we find that when the D-branes are separated, we have N 
massless vector fields, one for each of the hyperplanes. But as the hyperplanes 
converge, the number of massless vectors rises to A 2 , which is suflicient to 
generate a U(N) symmetry. The lesson here is that the gauge group associated 
with N parallel and distinct D-branes is 17(1 ) N , which becomes U(N ) as the 
D-branes become coincident. This will prove crucial when we discuss bound 
states of AD 0-branes. 


15.4 D-Brane Actions 

The action for D-branes can also be constructed in several ways. 

In the open string case, the background fields include the U(\) gauge field 
A m . The coupling of the string to the background field is given by 

/ p 

dsJ2 A m(x°,...,X P )d,X m 

m= 0 

/ 25 

ds J2 A ‘( X ° . X P )dnX‘' O 5 - 4 - 1 ) 

i=p+l 

where the massless vector field only depends on jc°, ..., x p , the coordinates 
describing the p-brane. We compactify the coordinates labeled by i. One 
important point is that A t describes the fluctuations transverse to the brane. 
Because of this, the p-brane is dynamical, rather than being a fixed hyper¬ 
plane. As before, one integrates out the higher string modes, imposing the beta 
function relation /? = 0. This, in turn, gives us the equations of motion of the 



496 15. D-Branes and CFT/ADS Duality 


background fields, including the t/(l) field. By explicitly performing all these 
steps, we find the D-brane action given by the Dirac-Bom-Infeld action 

T J d^ayf-detiGij + Fij - Bij), (15.4.2) 

where G and B are the usual terms formed from the pull-back to the membrane 
surface, i.e., G/ ; = 3 jf v g^ v , and where represents the value of at 

the boundary of the p- brane for the Dirichlet conditions. 

The fact that F tJ — B t j occurs in this particular combination can be seen 
from the fact that the gauge transformation of the B field, which usually can¬ 
cels to zero, now picks up a boundary term, which must be cancelled by a 
corresponding change in the vector field 


Bij * Bij + 3 / Xj 3 j Xi » 

A*- Ai - Xi • 


(15.4.3) 


There is yet another way in which to derive the D-brane action by compact- 
ifying higher p-branes [11, 12]. We can start with the p = 2 M-brane action 
of M-theory and dimensionally reduce it down one dimension [10, 11]. The 
resulting 10-dimensional theory should contain both the usual Type IIA string 
as well as the p — 2 membrane, which is the D-brane. 

We recall that the supermembrane action in 11 dimensions in flat space is 
built out of the generic term g lj Y\^Ylj M , where M is an 11-dimensional index. 
If we separate out the eleventh dimension, and define <f> = X u , then we get 

g iJ 'nfn^ + i g ij - wr u d,e) (w - wr u a,0). 

(15.4.4) 

Now we wish to replace the 4> field everywhere with a vector field L t , such that 
3 i4> — Li . This is accomplished by adding to the action a Lagrange multiplier 
Aj : 

e ijk Ai djL k . (15.4.5) 

If we eliminate the A, field from the action, then it enforces the constraint 
3 iLj — 3 jLi — 0, which is solved by setting L, = 3/0, so we can replace 3/0 
with Li everywhere. 

So far, we have done nothing, since by eliminating A t we retrieve the original 
action. Now, let us take a different path and eliminate L/ instead. Then the 
relevant terms in the action are 


€ iJk Af djL k + g‘ J L,Lj + • • •. 

Eliminating L/ from the action yields the U(l) Maxwell term 

F(A) u F u (A)g ik gj l . 


Putting everything together, we find 

s = ~\j nf - l] 


(15.4.6) 

(15.4.7) 


(15.4.8) 



15.4 D-Brane Actions 497 


~\f d 3 cr € iJk [b ijk - 3/0^, dfiFjt ], (15.4.9) 

where 

F ij = F ij -b ij , (15.4.10) 

and b ijk and b t j are complicated functions of n and 9 , arising from the original 
b^k found in the membrane action, in which the 3 f Xn components have been 
explicitly removed. 

In summary, we find that the low-energy D-brane action is the usual Nambu- 
Goto action with an additional £7(1) Maxwell action. It has the structure of a 
Bom-Infeld action. 

Similarly, we can also calculate the coefficient that appears in front of the 
D-brane action, i.e., the tension T or energy per unit volume. We recall that the 
tension of the fundamental string ( T F \) is given by 7>i = 1 /(lira') = 2nm 2 s . 
Let us denote the tension of the supermembrane by T M2 , which is equal to 
27tm 3 p , where m p is the Planck mass. If we compactify the supermembrane on 
a circle of radius R , this pulls out an addition 2nR out of the metric tensor. 
Since M-theory compactified on a circle is Type IIA string theory, we can 
derive the tension of the Type IIA string from the supermembrane 

T F i = 2ttRT M2 . (15.4.11) 

We can also derive the tension of the 10-dimensional membrane (the D 2- 
brane) by placing the 11-dimensional supermembrane in a background space 
where a transverse dimension is circular. The two membranes have the same 
tension 


T d 2 = 2 nm 2 s /g s = T M2 (15.4.12) 

where we have used the fact, found earlier, that 2nRm p = g 2/3 . 

This is a special case of the tension of the D p -brane 

T Dp = 2nmr +l /g s . (15.4.13) 

Notice in particular that the tension of the D-brane goes as 1 /g s , which differs 
from the behavior of a soliton, which goes as \/g]. 

We can also check this formula for the higher D-branes. For the 5-brane, the 
tension is given by T M5 = 2n p . But the D 4-brane found in Type IIA theory is 
given by compactifying this 5-brane, so we have 

T D4 = 2ttRT M5 . (15.4.14) 

We also see that this is equal to 2jzm\lg s , as predicted. 

Also, if the 5-brane is not wrapped around Si, then we have a 5-brane in 10 
dimensions, which is called the NS 5-brane. It must have the same tension as 
the 5-brane 


Tnss = T m 5 = 27tm 6 s /g 2 s . 


(15.4.15) 



498 15. D-Branes and CFT/ADS Duality 


(Notice that the 5-brane tension goes as 1 /g s , which is typical of a soliton, 
rather than a D-brane.) 

Because there are several ways in which to obtain the action of a D-brane (by 
compactifying M-theory or by compactifying Type IIA or Type IIB theories) 
we have several ways in which to derive the tension of lower-dimensional 
D-branes. Each time, we find that they are all consistent. 

In this way, we can also derive the tension for the (p, q) string found in Type 
IIB theory. We can start with the usual supermembrane of M-theory, and then 
wrap one of its cycles around a (p, q) cycle of a torus. Since the minimum 
length of this cycle is proportional to \p + qr |, the tension of the (p, q) string 
can be derived from the tension of the supermembrane 

T pa = 27 T I + qT\m], (15.4.16) 

where p and q are relatively prime integers. This formula agrees with the 
tension of the (1, 0) string, which is the original Type IIB string. The tension 
of the (0, 1) string gives us the standard tension of the D 1-brane. 

Now that we have derived the action for the D-brane, as well as the tension 
appearing in the action, the most immediate application of D-branes is to 
analyze the proof that M-theory compactified on a circle is equivalent to Type 
IIA string theory. Crucial to the proof was the statement that the Kaluza-Klein 
modes resulting from compactified M-theory correspond to BPS saturated 
modes in 10 dimensions. We saw earlier that the momentum of the eleventh 
dimension is quantized according to 


Pi . = (15.4.17) 

The N = 0 term corresponds to the ordinary string, the N = 1 corresponds to 
a single D 0-brane, and the higher N modes correspond to bound states of D 
0-branes. 

Now consider the case of N parallel and flat D-branes, with an open string 
attached between each pair. Each end of the open string has a U(l) gauge field 
associated with it. Now let the N parallel D-branes gradually merge into a single 
hyperplane. Then we find that additional massless vector boson states emerge, 
which yield a gauge theory based on U(N) [12]. Now re-do the calculation 
of the D-brane action for coincident D-branes using the = 0 condition. 
If the branes are widely separated, then we arrive at the usual U( 1) Dirac- 
Bom-Infeld action. But now perform the same = 0 calculation, letting the 
D-branes slowly coincide. We then find that coincident D-branes are described 
by 10-dimensional U(N) super-Yang-Mills theory reduced down to the p + 1 
space of the D-brane. 

This action can also be simplified by taking the original p-brane action and 
choosing the gauge X m = a m , which creates a 8 mn in the Bom-Infeld action. If 
we power expand around this delta function, then we arrive at the Yang-Mills 
action to lowest order, plus its coupling to the p-brane degrees of freedom. 



15.4 D-Brane Actions 499 


Notice that the fields are only functions of p + 1 variables, not the entire 
10-dimensional space. This means that many of the terms drop out 

Fmn = 3 m A n - 3 n A m + i[A m , A n ], (15.4.18) 

F mj = d m Xj+i[A m ,Xj], (15.4.19) 

F ij =i[X i ,Xj], (15.4.20) 

where/, j represent the modes i = p + 1, p + 2,..., 9,and the p-brane modes 
are represented by m = 0, 1,2 ,p. 

Consider the case p = 0, representing a point-particle in a weak field. For 
the D 0-brane, this means that all the membrane coordinates have disappeared 
except for one 


■ F u =i\X‘,Xj], 

Foj = D 0 X‘ = d 0 X j + i[A 0 , X>], 4 

Dj0 = i[Xj,O], K ) 

Dq6 = do0 + r[A 0 , 0\- 

So the action reduces to 

S = T j dt Tr(i(Z> 0 X') 2 - iO T D 0 9 + ± ([X', X']) 2 . + 9 T y'[Xj, 6»]) . 

(15.4.22) 

Notice that we are now treating the p-brane variables as if they were space- 
time variables. (The action for the D 0-brane will play an essential role when 
we discuss M(atrix) theory in the next section.) 

For the more general case, we have the bosonic action for the D-brane 

5 ~ J d p+i a Tr( - \F mn F mn + iD m X' D m X' + i[X', X'] 2 ), (15.4.23) 

so the action resembles a U(N) gauge field in p + 1 space-time dimensions 
interacting with a scalar field X 1 . When we add fermionic fields, the theory 
becomes the standard supergauge theory. Notice that this action is just the 10- 
dimensional super-Yang-Mills action dimensionally reduced down to p + 1 
space-time coordinates. (In fact, this is how the super-Yang-Mills actions were 
originally calculated years ago by dimensionally reducing 10-dimensional 
supergauge theory.) 

Let us now briefly summarize some of the properties of D-branes: 

• By definition, D-branes are p-branes on which open strings can end. 

• They carry Ramond-Ramond charge. Thus, in Type IIB string theory, they 
allow us to make an SL( 2, Z) transformation between 1-branes carrying 
charges ( n , m). 

• They are BPS states, and hence play an integral role in determining the 
nonperturbative nature of string theory. For Type IIA strings, they have 
even p. For Type IIB strings, they have odd p. 



500 15. D-Branes and CFT/ADS Duality 


• Because they are solitonic, there is a “no force” condition between pairs 
of D-branes. (The gravitational and dilatonic attraction is cancelled by the 
electrostatic repulsion.) 

• The absence of a force between D-branes means that they can easily be 
piled on top of each other. This, in turn, allows us to create D-branes of 
arbitrary charge. 

• The action of a D-brane is given by a Bom-Infeld action with an effective 
U( 1) gauge field, where the vector potential lives on the /7-brane surface. 
But one can also create U (N) gauge theories by placing N D-branes on top 
of each other. When we add fermions, the action becomes a supergauge 
theory in p + 1 dimensions. 

• The dynamics of interacting D-branes is given by the excitations of the 
open string that connects them. Thus, a potentially intractable problem 
of determining the scattering of D-branes reduces to the almost trivial 
problem of analyzing conformal field theory. 

• The tension of the D-brane goes as 1 /g s , while the more familiar soliton 
mass goes as g~ 2 . 

• The SL(2, Z) symmetry found in the previous chapter on Seiberg-Witten 
theory can now be reanalyzed. A stack of D 3-branes of Type IIB theory 
is described by N = 4 supergauge theory in four dimensions, which is 
known to have SL( 2, Z) dualities. But this is just a remnant of the SL( 2, Z) 
symmetry of the original Type IIB D 3-brane. 


15.5 M(atrix)-Theory and Membranes 

One of the most important problems in M-theory is to find a proper definition 
of the theory. At present, all we really know about M-theory is that it contains 
D — 11 supergravity in the low-energy limit and it reduces to Type IIA string 
theory when compactified on a circle. 

Several attempts have been made at a definition of M-theory, with surprising 
results. The first is matrix models [13, 14], in which M-theory in the infinite 
momentum frame emerges in the N -» oo limit of 10-dimensional D 0-branes. 
The second is CFT/ADS duality, in which M-theory and strings in anti-de Sitter 
space in the’t Hoofi limit g^N -> oo of a conformal gauge theory in four 
dimensions. 

Let us begin with matrix models, based on the following conjecture: 

Conjecture. The infinite momentum limit of M-theory is equivalent to the 
N oo limit ofN coincident D 0-branes , given by U(N) super-Yang-Mills 
theory . 

This conjecture is a special case of a more powerful conjecture involving 
finite A. If we take the M-theory in the light cone gauge and then compactify 
it in the light-like direction, then an integer N naturally emerges labeling the 
Kaluza-Klein modes. This is called the discrete light cone gauge. The second 



15.5 M(atrix)-Theory and Membranes 501 


conjecture states that N D 0-branes, for finite A, approximates M-theory in 
the discrete light cone gauge. What is unusual about this second conjecture is 
that it works for finite A, without taking any limits. 

At first, it may seem counterintuitive that a theory as rich and powerful 
as M-theory, which contains membranes and 5-branes, could be reduced to a 
simple theory of point-particles, i.e., U (A) Yang-Mills theory. It furthermore 
seems quite miraculous that the A oo limit of a 10-dimensional gauge 
theory should reproduce a fully 11 -dimensional theory. Somehow, the eleventh 
dimension emerges out of this A oo limit. 

However, we recall that the supersymmetric translation algebra in 11 di¬ 
mensions contains the term p M , which decomposes into and Z — p u after 
compactification on a circle. Notice that Z corresponds to the charge of a Type 
IIA D 0-brane, given by the Kaluza-Klein quantization condition 

A 

Pn = ~ (15.5.1) 

for integer A. As we saw earlier, the Kaluza-Klein states of the compactified 
11 -dimensional M-theory are equivalent to the nonperturbative 10-dimensional 
D 0-brane states. 

Now take the infinite momentum limit, i.e., let Z = pn oo. In this case, 
intuitively the D 0-brane term dominates over all other terms in the super¬ 
translation algebra, so that the theory reduces to that of a D 0-brane. This 
infinite momentum limit, in turn, can be realized by taking the limit A -> oo 
and pn -> oo. 

Notice that a single D 0-brane has momentum given by 1 /R, and that a bound 
state of A D 0-branes (which coincide) has momentum N/R. Furthermore, one 
can show that this bound state of A D 0-branes is described by supersymmetric 
quantum mechanics (not quantum field theory). The string state corresponds 
to A = 0. This is because strings cannot be the source for the R-R fields, 
which are represented by the central charge Z. 

The essential reason why this conjecture works is that the infinite momentum 
frame allows one to cancel infinite classes of Feynman diagrams as p u —> oo, 
which is one of the reasons why it was introduced years ago. We will find that 
the only diagrams which survive this limit, in fact, are the D 0-branes. This is 
the key observation which makes the conjecture work. 

To see this, let p a label the momenta of a collection of particles. Let the sum 
of these vectors equal P. Then each momenta p a can be decomposed as 

Pa = VaP +P±a (15.5.2) 

such that 

X^ a = 1, Pa'P±a= 0. (15.5.3) 

a a 

Now let us boost the system of particles such that P -* oo. For sufficiently 
large P, we see that all r\ a are positive. 



502 15. D-Branes and CFT/ADS Duality 


The point of taking this large P limit is that we can write the energy as 

E a = y/p 2 a+ m2 

_ J2 , 2 

= r) a \P\+ “ ~ a +0{P~ 2 ). (15.5.4) 

2?la\P\ 

We see that the covariant propagators start to resemble nonrelativistic energy 
denominators found in nonrelativistic perturbation theory. Let us focus on the 
differences in energy. 

If two particles have positive and equal r \ a , we see that their energy difference 
goes to zero as 1/P. But now analyze what happens to the energy difference 
between a particle with positive rj a and another with negative rj a (which oc¬ 
curs because we must integrate over all momenta in field theory). The energy 
difference grows as P. Thus, this energy denominator damps as 1/P -* 0, 
so that particles with rj a < 0 decouple from the theory. Thus, our conclusion 
is that states with negative or vanishing 7] a have energy denominators which 
make them decouple from the theory, leaving only the states with positive r] a . 
But in matrix models, the only states with positive r\ a are those states with 
momentum p n = N/R , where N > 0. Thus, M-theory in the infinite mo¬ 
mentum limit reduces to a theory of particles with positive r \ a , i.e., states with 
positive pn, which are the D 0-branes. These states, in turn, are described by 
super-Yang-Mills theory defined on the world volume of the D-brane. The 
states with vanishing or negative rf a are the string states and anti-D-branes, 
which decouple from the theory because of infinite energy denominators. 

We also find convincing proof of this conjecture when we investigate the 
scattering of D 0-branes, and find they exactly reproduce the scattering of 
11-dimensional supergravitons [15-18], which is a highly nontrivial result. 

Lastly, we also see evidence of the matrix model conjecture in the fact that 
11-dimensional membranes which appear in M-theory can also be viewed as 
collections of D 0-branes. This is a rather remarkable result, since the lack of 
a suitable quantization scheme for the 11-dimensional membrane was one of 
the many reasons why it was abandoned in the 1980s. The reasons included: 

• The action was highly nonlinear. The Hamiltonian contained a quartic 
term, and hence calculating the spectrum of states was intractable. Unlike 
the string Hamiltonian, which was quadratic and hence solvable, the free 
membrane Hamiltonian was highly nonlinear, making a rigorous analysis 
of its spectrum impossible. 

• The Hamiltonian was not bounded from below. This meant that a local¬ 
ized wavefunction was not stable, but leaked from the membrane via an 
infinite network of tiny threads. Hence, the theory was not stable quantum 
mechanically. 

• The world volume action was ultraviolet divergent, and hence a sigma 
model approach to quantization was impossible. 



15.5 M(atrix)-Theory and Membranes 503 


• The interactions of membranes was totally unknown. We cannot, as in 
string theory, simply sum over the set of all two-dimensional surfaces with 
handles, since, in general, mathematicians have not been able to classify 
all possible three-dimensional surfaces with handles. 


For these reasons, the theory of supermembranes was largely abandoned. 
But the new interpretation of the 11-dimensional membrane theory is that the 
theory arises in the large N limit of D 0-branes, and hence all the problems of 
membrane theory can, in principle, be removed. For example, the instability of 
the membrane can now be explained because it is, in some sense, a collection 
of D 0-branes. 

Let us begin our analysis by taking the light cone gauge on the supermem¬ 
brane action. We will break-up the world volume metric by separating out the 
zeroth time index. Let a, b — 1,2represent the space-like indices on the 
world volume, and let /, j = 0, 1, 2,..., p represent the usual world volume 
indices. Then 


Sij — 


o) 2 [—h + h ab u a u b ] —a) l h ab u b 
-co~ l h ab u b h ab 


(15.5.5) 


where h ab represents the metric over the 2 x 2 spatial coordinates. Notice that we 
have exchanged the (p+ l)(p+2)/2 components of g t j for the pH-1 coordinates 
co, u a , and the p(p + l)/2 coordinates h ab . (This particular parametrization 
is taken from the ADM variables used to quantize ordinary gravity in the 
canonical formalism. h a and co are then the usual shift and lapse functions.) 

The equations simplify if we let p = 2. If we introduce the notation 

{f,g} = € ab d a fd b g, (15.5.6) 


then the gauge-fixed membrane action for the bosonic coordinates is 


S = \jdrj </ 2 <r[(9 0 X 7 -{a>, X 7 }) 2 - | {X 7 , X 7 } {X 7 , X- 7 } ]. 

(15.5.7) 

If we add in the fermionic variables, then the full Hamiltonian is 
H = j d 2 o[\Pj + \h ab ^YaY^ + H^.X J ){IM 7 )], (15.5.8) 


where y a = 3 ^y 1 and y = i/l€ ab d a X l 3 b X J y IJ . 

In this equation, we see all the pathologies of the theory. The Hamiltonian 
is quartic in the string variable X 1 , meaning that it is extremely difficult to 
quantize the theory. The free theory of membranes is thus highly nonlinear 
and basically intractable. This alone has discouraged research on membranes. 

Second, and more important, we see that there are directions in which this 
quartic term actually vanishes. Since this term corresponds to the surface area 
of a membrane, we see that it vanishes if the membrane degenerates into a 
long, infinitely thin line with zero area. 



504 15. D-Branes and CFT/ADS Duality 


If we imagine that the membrane looks like a porcupine with millions of 
tiny “quills” emanating from it, we see that the wave function of a membrane 
can “leak” out from these quills, and hence the wave function is not stable. 

We also have the problem that the theory is ultraviolet divergent on the world 
volume, which means that quantization of the theory is problematic. 

Some insight into this problem can be seen if we compare this theory with 
the action of ordinary super-Yang-Mills theory defined with only one time 
coordinate [19]. Let us introduce A 1 — x a A aI , where x a is a generator of 
U(N). Then the action becomes 

5= X - f dxTr{(D () A 1 ) 2 -{[A 1 , A J ][A ] , A^+ifD^ + ir/ 1 [A 1 ,f]\. 

(15.5.9) 

Remarkably, we see a strong resemblance between the two theories. 
Basically, we wish to make the following correspondence between U(N ) 
Yang-Mills theory and membranes 



(15.5.10) 


The key is that we have taken the limit as N goes to infinity. In this limit, 
U(N ) becomes f/(oo), which (if we carefully take the limit in a certain way), 
becomes w(oo), which corresponds to area-preserving diffeomorphisms for 
spherical membranes and torii. The two-dimensional coordinate {a \, 02 ) of 
the membrane thus emerges out of the index a of the U ( N ) group. 

In the matrix model interpretation, the membrane is seen as merely a certain 
collection of D 0-branes. The instability of the membrane is now seen as a 
simple consequence of the “no-force” condition usually found for solitons. 
Thus, the matrix model has given us a new interpretation of the membrane as 
a peculiar bound state of D 0-branes. 

This also solves a puzzle found earlier. In the matrix model approach, there 
is the danger that, by singling out the D 0-branes and eliminating the superme¬ 
mbrane contribution, we are accidentally dropping an essential contribution 
from the supermembrane. But the fact that U(N) supergauge theory, in the 
N —> 00 limit, becomes the supermembrane theory solves this puzzle. 


15.6 Black Holes 

Lastly, we should mention briefly that string theory can also give us a statistical 
derivation of the Bekenstein-Hawking radiation formula [20-24], Previously, 
this celebrated formula, relating the entropy of a black hole to its area, was 



15.6 Black Holes 505 


calculated using thermodynamical arguments. (We have the black hole equa¬ 
tion dM = (l/$ tcG)k dA where M is the mass of the black hole and A its 
area. This formula, in turn, is similar to the thermodynamic equation relating 
entropy to energy dE = TdS .) The Bekenstein-Hawking relationship then 
states that the entropy 5 of a black hole is equal to one-fourth the area of its 
event horizon 

(15.6.1) 

However, this was unsatisfactory, since ideally one would want to calculate 
the entropy of a black hole by statistical mechanics, i.e., by counting its ther¬ 
modynamic states. We want to write 5 = k log W, where the partition function 
W is summed over quantum states. This, of course, requires a quantum theory 
of gravity. 

Because string theory can now give us a quantum theory of black holes, 
one should be able to derive the Bekenstein-Hawking radiation formula by 
counting black hole states. This is done by taking a Type IIB string theory and 
compactifying it on down on a five-torus to a five-dimensional manifold. A 
Type IIB theory has both D 1-branes and D 5-branes. In particular, it can be 
shown that the states of the black hole reduce to counting the states of a string 
which is stretched between these two D-branes. 

If we wrap the D 1 -brane around a circle, the string carries a charge Q \ = 1 . 
Likewise, wrapping a D 5-brane around a four-torus gives us a charge of 
Qs = 1. Now it is easy to calculate the total number of states of a string 
stretched between these two D-branes. We recall, for ordinary string theory, 
that the degeneracy of harmonic oscillator states for a conformal field theory 
at level L 0 = n, for large n, and for central charge c, is given by 

d(n, c ) exp {lnyJnc/6^ (15.6.2) 

for large n. 

Thus, we need to calculate c for these strings. In a simple conformal field 
theory, with A B species of bosons and A F fermions, c is given by 

c = N b + \N f , (15.6.3) 

where N B — 4Qi. We thus find that 

d(n, c ) -*■ exp Iti-JQ\ n (15.6.4) 


for Q 5 = 1. 

But we know that the system is dual between the 1 -brane and 5-brane. Thus, 
the final result should be written in terms of the product Q\Qs- Therefore we 
can relax the condition Qs = 1, and write the answer in terms of the product 
Q i Qs- Thus, the final answer is given by 5 = log d, or 


S = 27Ty/ QiQsn. 


(15.6.5) 



506 15. D-Branes and CFT/ADS Duality 

If we rewrite the area of the black hole in terms of Q, and Q 2 , then we find 
precisely S = A/A. 

We see that the key to this calculation was the fact that the counting of 
statistical states was dominated by the counting of the states of a string, which 
is well known. Although this result was derived for a five-dimensional black 
hole using Type IIB strings, the result is probably quite general and has been 
shown to hold down to four dimensions for a variety of compactifications and 
backgrounds. Similarly, the calculation can be carried out for rotating black 
holes. 


15.7 CFT/ADS Duality 

The matrix model approach, although it has survived many challenges, still 
has conceptual problems. The fact that M-theory, with all its rich and diverse 
structure, can be fundamentally defined in terms of a 10-dimensional, non¬ 
covariant action Yang-Mills action in the N oo limit is quite remarkable. 
However, it is hard to believe that this noncovariant 10-dimensional Yang- 
Mills action can provide a fundamental and essential definition of M-theory. 
However, only time will tell. 

The second way in which a definition of M-theory can be written comes 
from CFT/ADS duality [25]. Unlike the matrix model approach, anti-de Sitter 
duality is covariant. (However, it is defined not in 11 dimensions, but in four!) 

Anti-de Sitter space first emerged in discussions of supersymmetry when 
it was found that supergravity with global 0(N ) symmetry became locally 
invariant under the O (N) group when defined in anti-de Sitter space. 

Furthermore, when the D = 11 supergravity theory was defined in anti-de 
Sitter space, we could use the Freund-Rubin ansatz to find classical solutions 
of the equations of motion of supergravity 

Ff, ivpo ^ ^ixvpa' (15.7.1) 

This was a simple but elegant way to break the 11-dimensional space down 
to AdS 4 x S 7 . 

Furthermore, one could also generalize the Freund-Rubin ansatz. One could 
take the dual of the tensor and set that equal to the € tensor, which breaks 

space-time down to AdS 7 x S 4 . And lastly, one can also take the same type of 
ansatz for the Type IIB superstring field tensor, giving us AdS$ x S 5 . 

However, one of the reasons for the recent intense interest in anti-de Sitter 
spaces is the surprising observation made by Maldacena [25] that we can 
write dual relationships between weakly coupled string theories and strongly 
coupled four-dimensional QCD-type theories. One important class of dualities 
is between Type IIB string theory defined in 10 dimensions and N = 4 four¬ 
dimensional superconformal gauge theory. Because this duality links weakly 
coupled strings to strongly coupled field theory, this may give us deep insight 
into the nonperturbative nature of gauge theories. Other dual relationships 



15.7 CFT/ADS Duality 507 


can be found between M-theory membranes and three- and six-dimensional 
superconformal theories. 

What is surprising about this duality is that it links two theories with different 
symmetry groups, different space-time dimensions, and different numbers of 
supersymmetry generators. In particular, we can define a duality between a 
10-dimensional supergravity theory defined in anti-de Sitter space, with 32 
supersymmetric charges, to a four dimensional N = 4 superconformal Yang- 
Mills theory with SU(N) symmetry and 16 supersymmetric charges. 

It should be stressed that not all aspects of this strange duality have been 
worked out, and in fact there may be severe roadblocks that prevent us from 
probing the nonperturbative nature of QCD with quarks and gluons. Indeed, 
it may turn out that this duality allows us to make nonperturbative state¬ 
ments about theories which resemble QCD, but differ from it in crucial ways. 
However, the fact that dualities can be written at all is remarkable. 

These new developments make use of the observation, make years ago by’t 
Hooft, who analyzed gauge theories with SU(N) symmetry and then calculated 
their behavior for large N. He found that in the limit N oo, where is 
kept fixed, the planar Feynman diagrams defined over two-dimensional world 
sheets dominated the perturbation series. This gave rise to speculation that 
gauge theory, in the’t Hooft limit, could be approximated by string theory. 
However, efforts over the years to prove or disprove this conjecture were in¬ 
conclusive. That is one reason why the CFT/ADS duality is so interesting, 
because it may eventually resolve this long-standing question and shed light 
on the confinement problem in QCD. 

Specifically, one can show that Type IIB superstrings on anti-de Sitter space 
AdS 5 x S 5 , with string coupling constant g s , appears to be dual to N = 4 
superconformal Yang-Mills theory in the large N limit, where g s = #ym> 
and where gy U N * s l ar g e held fixed. In the ’t Hooft limit, this means that 
#ym and g s are both small. Thus, the weak coupling limit of string theory 
(approximated by supergravity) allows us to probe the strong coupling region 
of N = 4 superconformal gauge theory. 

But what is especially interesting is that one can take different limits on 
M-string theory. By taking suitable limits on the M-theory 5-brane, and then 
compactifying two more dimensions, one can even yield dualities with non- 
supersymmetric theories in four dimensions, such as QCD (without quarks). 
The ability of superstring theory and supergravity to probe the nonperturbative 
region of QCD has thus excited much attention. 

We begin with the observation made earlier that D-branes have gauge fields 
A p which live on them. If we have N parallel D-branes which coincide, then 
the Ap field is elevated to a U ( N ) gauge field. For a D 3-brane, for example, we 
find that, in the weak coupling regime, that the U ( N ) gauge field lives totally on 
the D 3-brane itself, and hence is a function of only four-dimensional variables, 
which are now interpreted to be space-time dimensions. In particular, if we 
start with N parallel Type IIB D 3-branes and bring them together, we arrive 
at the familiar four-dimensional N = 4 superconformal gauge theory. 



508 15. D-Branes and CFT/ADS Duality 


Before, we wrote the general solutions to the supergravity equations of 
motion involving p-branes in D dimensions. For N coincident IIB branes, we 
find the 10-dimensional metric to be 


ds 2 = f l/2 (-dt 2 + dxdx) + f l/2 (dr 2 + r 2 dQ\_ p ), (15.7.2) 


where 

/=i+(%£) t"- 7 . 

u = r ~r 

e 2<P _ g 2 f(3-p)/2' 

g 2 YM = g s (aT~™ 2 - (15.7.3) 


The key variable U is roughly equal to the energy of a string which is stretched a 
distance r away from the D-branes. Notice that we have a relationship between 
the Yang-Mills coupling constant g Y m (which we can calculate by analyzing 
the gauge fields living on the world volume of the 3-brane), and the string 
coupling constant g S9 which we can calculate by analyzing the dilaton field. 
The horizon corresponds to the point r — 0. 

Now, the crucial step in this analysis is to go to the near-horizon of the 
geometry. We can go to r ~ 0 by setting a! 0 and holding U fixed. When 
this is done, the 1 appearing in / can be removed, and the metric simplifies 
considerably. By setting p = 3, we find 


ds 2 


a 


y/X 


u\ 


-dt 2 + dxdx) + VkU~ 2 dU 2 + Vxd& 2 5 


, (15.7.4) 


where A = g Y u^- Notice that the string coupling is a constant: e 4, = g s . 

Now, let us make one last change of variables: z = VX/t/. The metric takes 
the final form 


ds 2 



(—dt 2 + dxdx) + dz 2 


+ dQ 2 


(15.7.5) 


The common factor in front of the equation, given by a'\/T, means that the 
radii of the two spaces are the same: R = A. 1/4 (a') 1/2 . Notice that this is the 
metric for AdS$ x S 5 . 

Several observations can be made at this point: 

• If we go the’t Hooft limit where N -> oo while keeping l ar £ e but 
fixed, we find that g s is vanishingly small. This means that we can show 
the duality between free strings and the nonperturbative region of N = 4 
superconformal Yang-Mills theory. 

• The metric is now defined in AdS$ x S 5 space, with the radii of the AdS 
space and the S$ sphere being equal. The radius, which goes as Af 1/4 , is 
large in the’t Hooft limit. Because the radii are large and equal, one can 



15.7 CFT/ADS Duality 509 


show that the theory is actually conformally flat. The theory, in fact, is 
superconformally invariant. 

• The original Type IIB string had 32 supersymmetries. The N = 4 theory 
only has 16. The remaining 16 emerge nonlinearly because of the existence 
of superconformal invariance. 

• The isometry group of AdS 5 is 5 0(4, 2), while the isometry of S 5 is 0(6). 
But 50(4,2) is the conformal group in four dimensions, which can also 
be written locally as SU(2 , 2). Likewise, 0(6) can be written as 50(4) 
locally. If we include the supersymmetric generators, then we have the 
complete isometry group 50(2, 2|4). Thus, the representation space of 
states must fall into multiplets of 50(2, 2|4). 

• At first, it may seem strange that a duality may be defined between a 
four-dimensional theory and a 10-dimensional one. This is resolved by re¬ 
alizing that the A = 4 superconformal theory lives on the four-dimensional 
boundary of the five-dimensional anti-de Sitter space. Thus, we have an 
example of “holography,” i.e., that the physical information of the bulk of 
five-dimensional anti-de Sitter space is somehow faithfully encoded in its 
four-dimensional boundary. 

• It can be shown that this duality exists between anti-de Sitter supergravity 
and SU(N) supergauge theory (not U ( N ), as one might naively expect). 

Notice that the above analysis does not constitute a rigorous proof of this 
remarkable duality. As we mentioned, there are many loopholes that still have 
to be investigated. However, this duality has so far survived numerous checks 
that have been made between these two seemingly distinct theories. 

The construction of this surprising duality is not an isolated case. We can 
find similar dualities for a wide variety of other examples. In addition to the 
Type IIB D 3-brane, the most interesting cases are for the D = 11 M-theory, 
where we have membranes and 5-branes. Let us analyze the 5-brane case. 

On one hand, we know that the world volume action for 5-branes in 11 di¬ 
mensions reduces to an abelian gauge theory which lives on the six-dimensional 
world volume of the 5-brane. For N coincident 5-branes, the theory re¬ 
duces to a six-dimensional non-Abelian gauge theory with ( N + , AL) = (0, 2) 
supersymmetry. 

On the other hand, we know that the D = 11 supergravity equations of 
motion have a solution for the 5-brane 


ds 2 = f~ l/ \-dt 2 + dxdx) + f 2/ \dr 2 + r 2 dQ 2 4 ). 


In this case, we have 


/ = 



U 2 = rjIp, 


(15.7.6) 


(15.7.7) 


and where l P is the 11-dimensional Planck length. 



510 15. D-Branes and CFT/ADS Duality 


We now analyze its near-horizon behavior by taking the limit l P -> oo, 
while keeping U fixed. Then we obtain 

ds 2 = l 2 p |(7rAf)“ 1/3 t/ 2 (—dt 2 + dxdx) + 4(ttN) 2/3 ^^- + (ttN) 2/3 d£l\ j . 

(15.7.8) 

As before, we can write this metric in anti-de Sitter space. The geometry is 
then given by AdSj x 54 . The radii of the anti-de Sitter space and the sphere 
R can be given by R = RAds/2 = lp(xN) l/3 . The isometry group of this 
geometry is given by 50(2, 6) x 50(5). 

This theory, we shall see, can prove useful if we compactify two more 
dimensions to arrive at a four-dimensional theory. 

Similarly, we can analyze the D = 11 membrane in much the same way. 
The metric is given by 

ds 2 = f~ 2 / \-dt 2 + dxdx) + f 1 / 3 (dr 2 + r 2 dQ 2 ), 

2 5 n 2 Nl% (15.7.9) 

/ — 1 + —H 1 - 

r b 

We can analyze this metric in the limit l P -> 00 , while U l/1 = rlp 3/2 is kept 
fixed. 

The same analysis can be performed as before, yielding the space AdS 4 x 5 7 , 
whose isometry group is 50(2, 3) x 50(8). 

Other dualities are possible to write. For example, we can derive a relation¬ 
ship between 10-dimensional IIB theory and (1 + l)-dimensional conformal 
field theory. We can start with the IIB theory and compactify it on a four¬ 
dimensional manifold M 4 , such as the torus T 4 or K 3 . This gives us a 
six-dimensional theory. If we began with a IIB D 5-brane, then wrapping it up 
on the four-dimensional space would give us a combination of six-dimensional 
5-branes and strings, the so-called D1 + D5 configuration. The gauge fields 
living on the two-dimensional worldsheet of the strings will give us a two- 
dimensional theory, and a (1 + l)-dimensional (4,4) superconformal field 
theory. 

If we have Q\ D strings and Q$ 5-branes, then we can write the remaining 
six-dimensional metric as 

ds 2 = / 1 1/2 / 5 _1/2 (— dt 2 A-dxdx) + f \ 12 f\ 12 (dr 2 -\-r 2 d£l\), (15.7.10) 


where 


/i = l + 


ga'Q 1 

vr 2 


V4 

(2n) 4 a' 2 ’ 


/5 


1 + 


got'Qs 

r 2 


(15.7.11) 


where V 4 is the volume of the four-dimensional manifold. Now take the limit 
as a' -> 00 , and where U = r/a f , v , and = g/y/v are all held fixed. 



15.8 Anti-de Sitter Space 511 


Then the metric reduces to 

ds 2 — ct' |^— j==(—dt 2 + dx 2 ) 4- g 6y / QiQs-^r + geyfO^Qs d£l\ , 

6 1 5 (15.7.12) 

which has the geometry Ad S 3 x S3. As usual, the 1 appearing in / has disap¬ 
peared, allowing us to establish the near-horizon behavior in terms of anti-de 
Sitter space. 

In conclusion, we can establish a duality between (1 + l)-dimensional con¬ 
formal field theory describing the Higgs branch of the D1+D5 system and the 
IIB theory on Ad S3 x S3 x M 4 for specified values of Q\ and g 5 . Since the 
dynamics of two-dimensional systems is well known, this gives us a variety of 
checks on the correctness of this approach. 

Before discussing the consequences of these dualities, gauge theory and 
QCD in particular, let us say a few things about the mathematics behind anti-de 
Sitter space and the various superconformal groups. 


15.8 Anti-de Sitter Space 

In the previous section, we wrote the anti-de Sitter metric using Poincare 
coordinates 

ds2 = ^(j2 d y} +dz2 y ( 15 - 81 ) 

This is not, however, the standard anti-de Sitter space metric found in the 
literature. Anti-de Sitter space AdSj+i is defined by the set of points lying on 
the hyperboloid 


V8- ( l5 - 8 - 2 > 

i=\ ;=1 

In this space, the cosmological constant is proportional to —ra 2 , so the cos¬ 
mological constant is negative. If the cosmological constant were positive, then 
we would have ordinary de Sitter space, whose isometry group is SO( 4, 1). 
The topology of ordinary de Sitter space dS d+ \ is given by R\ x S d . 

In general, de Sitter spaces satisfy the condition 

Ruv ~ ^guv = -\Rgnv (15.8.3) 

So the cosmological constant A can be set to ^K, where the curvature R can 
be set to a constant. 



512 15. D-Branes and CFT/ADS Duality 


To analyze anti-de Sitter space, let us make the substitution 


so we replace t\ , t 2 with polar coordinates, r, 6 . We can now eliminate t since 
r 2 -t 2 — -m 2 / 8 . 

A line element in this space is written as 

ds 2 = ±dxf-±dt 2 
*■=1 y=i 

= ———-r — (1 + r 2 )d 6 2 + r 2 (dQ 2 ). (15.8.5) 

1 +r l 

Notice that this has the topology of Si x R d , which means that the time-like vari¬ 
able 6 is periodic, i.e., we have closed time-like curves in AdSj+i space. This 
could have unpleasant consequences, such as cyclic time and quantized en¬ 
ergy eigenvalues. To avoid this problem, we will pass to the universal covering 
space, so we allow 9 to range from — oo to +oo. 

We can also write the representations of this space [26-29]. The represen¬ 
tation of SO(3, 2) symmetry are written as D{Eq, s ), where E 0 is the lowest 
energy eigenvalue and s is the total angular momentum. 

The unitary representations are the ones where Eo > s + 1 for s = 1, |,... 
and Eo > 5 + ^ for s = 0 , j. 

D {\, 0) and D( 1, 5 ) are particularly interesting, because, curiously enough, 
they have no counterpart in the Poincare group. They were discovered by Dirac, 
and are called “singletons.” 

The massless particles are given by £>(1,0) © D( 2,0) and D(s + 1, s). 
Particles with E tl > 1 are massive. (For SO(4,2), the lowest states are given 
by doubletons. For SO( 6 , 2), we have tripletons.) 

If we have N = 1 supersymmetry, then all the linear representations 
reassemble into the following: 

I: D{\, 0) © £>(1, \)\ 

II : D(E 0 , 0 ) © D(E 0 + 5 , 5 )® D(E 0 + 1, 0 ), E 0 > \\ 

III: D(s + 1, s) © D(s + §, s + i), s = 1, §; 

IV : D(E 0 , s ) © D(E 0 + 5 , s + ±) © D(E 0 + ±, 5 - \) © £>(£ 0 + 1,4 

Class I corresponds to the singleton supermultiplet, while Class II corre¬ 
sponds to a Wess-Zumino massless supermultiplet, Class III corresponds to a 
massless gauge supermultiplet, and Class IV is a massive gauge supermultiplet. 

We can view these singletons as living not on the “bulk” of AdSd +1 space- 
time, but on its boundary, given by the ("/-dimensional manifold Si x S<i \. On 
this boundary space, the group SO{d — 1,2) acts as a conformal group. 


= Y, x h 


i =1 

= t} + t : 


■ 2 ’ 


(15.8.4) 



15.8 Anti-de Sitter Space 513 


It turns out that string theory on AdS 5 x S 5 is not the only dual relationship 
we can write. 

For example, if we start with the generic string action mentioned at the 
beginning of this chapter, we can set the dilatonic terms zero or constant by 
setting a equal to zero, i.e., 

a 2 = 4 - 2 (p + 1)(£> -p- 3 )/(D - 2) = 0. (15.8.6) 

In the limit that the dilatonic term goes to zero or a constant, we have a 
conformal theory. Setting this to zero, we find the three solutions: 

D = 11, d = 6 , 

D = 10, d = 4, (15.8.7) 

D= 11, d = 3. 

Two of these solutions correspond, in 11-dimensional M-theory, to the 
familiar 2-brane and 5-brane. 

Let us perform the same manipulations as before on the M-branes. For 
the 2-brane, we find the group AdS 4 x 5 7 . The isometry group is therefore 
S'<9(3, 2) x 5(9(8), and the superisometry group is <95/?(4|8). 

Likewise, when we take the limit of the 5-brane, we find that the theory is 
defined on AdS 1 x 5 4 . Its isometry group is 5(9(6, 2) x 5(9(5), and its super 
isometry group is OSp( 6 , 2|4). 


(15.8.8) 


where D represents the dimension of space-time and p the dimension of the 
p- brane. 

Fortunately, we know some of the representations of these groups. For the 
2-brane of M-theory, with isometry group Osp(4\$), we have eight scalars 
and eight spinors arranged in a supersingleton representation. Notice that this 
is precisely the field content of the supermembrane itself. As we mentioned 
before, we can view the singletons living not in the bulk of AdS 4 space, but 
on its three-dimensional boundary. This leads us to suspect that the membrane 
itself is a supersingleton. 

This is a bit surprising, since it shows us the tight relationship between the 
supermembrane and its superbackground. If we start with a supermembrane, 
then it moves in a background consistent with an anti-de Sitter space. But the 
boundary of this space is a singleton, which appears to be the membrane itself. 




514 15. D-Branes and CFT/ADS Duality 

Similarly, if we start with the D 3-brane of IIB string theory, with isometry 
group given by SU(2, 2|4), then its doubleton representation is given by one 
vector, four spinors, and six scalars. This gives us the field content of the 3- 
brane, so again we can view the 3-brane as a doubleton living on the boundary 
of the anti-de Sitter space. The supersymmetry is N = 8. 

And lastly, if we start with the 5-brane of M-theory, with isometry group 
OSp{6 , 214), then its tripleton representation is given by one chiral two-form, 
eight spinors, and five scalars. This is consistent with the field content of the 
5-brane, which can now be viewed as a tripleton living on the boundary of the 
anti-de Sitter space. The supersymmetry is (A+, JV-) = (0, 2). 

These representations can be summarized as: 


(15.8.9) 



15.9 AdS and QCD 


Armed with this information, one can make concrete steps toward fleshing out 
this surprising duality. First, let us clarify the relation between fields living on 
the bulk, i.e., the full anti-de Sitter space, and fields which only live on the 
boundary of the space. What will connect the fields on the boundary and fields 
in the bulk will be the Green’s function. Once this is established, then we can 
start to make precise one-to-one relationships between states defined on the 
boundary versus states in the bulk. 

Let a massive Klein-Gordon field 0 live in the anti-de Sitter space. Let the 
metric on AdS d+ \ be represented by 


ds 2 



(15.9.1) 


The boundary is specified by R d space. The boundaries are given by x 0 = 0 
and xo = oo. 

The field 0, which lives in the bulk, obviously has values on the boundary 
space as well. Let the value of the 0 field on the boundary of the anti-de Sitter 
space be given by 0 O . By solving for the Green’s function defined between two 
points on the boundary, we can construct the two-point function between two 
0o living at these points on the boundary. 



15.9 AdS and QCD 515 


For very large x 0 , we can drop the other variables, so the Green’s function 
for the Klein-Gordon equation satisfies the reduced equation 

(~*« +1 ~£~ 0 +m2 ) K(xo) = o> (159 ' 2) 
where we are interested in the boundary at xq = oo, so K depends only on 
xo . The solution we want has a boundary condition that it vanishes for xq. 
This gives the asymptotic solution K(x o) = Xq + a+ , where A.+ is larger of the 
solutions to a (A. + d) = m 2 . 

From this, we can generalize the solution for K for points in the bulk 


K = 


x 


d-\-X + 

0 


(*o + |x| 2 )‘ 5,+a + 


(15.9.3) 


If we separate the distance between these two points x and x' and place them 
on the boundary, then the long-distance behavior of the two-point correlator 
is dominated by the dimension A of the field 0o living on the boundary. The 
correlation function of two 0 O fields is given by ~ l/|x — x'| 2A . 

We find that the correlation function between two 0 O fields on the boundary 
is therefore given by 



dxdx 


/ 0o(x)0o(x r ) 

|x — x'l^+^+l 


(15.9.4) 


In this fashion, we can read off the relationship between the mass of the particle 
moving in the bulk and the dimension of the field living on the boundary 


A = \(d + s/d 1 + 4m 2 ). (15.9.5) 

We can therefore establish a one-to-one relationship between the dimensions of 
the states usually found in a four-dimensional conformal theory and the masses 
found in anti-de Sitter supergravity. These relations can be generalized, and 
we find exact agreement. 

Now that we have established the rough relationship between fields defined 
in the bulk and fields defined on the boundary, we can apply some of these 
results to the question of confinement. If we compactify the time direction in a 
QCD-like field theory, we expect that the theory will have a phase transition; 
for low energies (large /?), it should have confinement and a mass gap. For 
high temperatures, there should be deconfinement. This is what we find with 
anti-de Sitter space, where the time coordinate has period j3. 

To analyze the finite temperature behavior of a Euclideanized field theory, 
let us start with the topology S\ x Sj-i, where d may be, say, four. S\ represents 
the compactified time coordinate. This should represent the topology of the 
boundary of some anti-de Sitter space. 

In general, there are two anti-de Sitter spaces which fulfill this boundary 
condition. The first solution has topology 


X\ = S\ x Rd- 


(15.9.6) 



516 15. D-Branes and CFT/ADS Duality 


If we compactify R d and add the boundary point at r = oo, it can be converted 
into the ball B d . (The ball B d is the interior of a sphere S d +\.) So we could 
alternatively have represented the topology by S\ x B d . We will see that X\ 
corresponds to the low-temperature region, where we have confinement and a 
mass gap. The metric can be represented by 

ds 2 = (1 +r 2 )dt 2 + + r 2 d& 2 . (15.9.7) 

1 + r l 

The second solution has the topology 


X 2 = R 2 x S d - U (15.9.8) 

and corresponds to a black hole. If we compactify R 2 in the same way, we can 
write X 2 = B 2 x S d -\. We see that X\ and X 2 will represent two phases of 
the same theory, compactified superconformal gauge theory. This metric can 
be represented by 

ds 2 = V dt 2 + V- 1 dr 2 + r 2 dQ 2 , (15.9.9) 


where 


V = 1-+ r 2 . (15.9.10) 

r 

Now let us analyze the thermodynamics of these two theories. In the 1 /N 
expansion of QCD, we can define the partition function Z(P) and the free 
energy F via 


Z{fi) = Tr e~P H , 
F = -In Z, 


(15.9.11) 


where H is the Hamiltonian of the field theory. Since all graphs in the Feynman 
perturbation series can be analyzed for their N dependence, we can solve for 
the N behavior of F. 

Assume, for the moment, that we have confinement. Free gluons and quarks 
have disappeared, leaving only color singlets. Therefore the mass and multi¬ 
plicity of these bound states have therefore lost all dependence on N . These 
bound states contribute to the free energy with order 1. If we divide these states 
by N 2 in the thermodynamic sum, they will disappear for large N. In fact, after 
a division by A 2 , the only contribution which will survive in the sum is the 
ground state (e.g., vacuum diagrams which contribute N 2 species of gluons in 
the adjoint representation). 

For confinement, we therefore have: 


lim 

N—>oo 


HP) 

N 2 



(15.9.12) 


where the contribution to the sum from bound states vanishes and only the 
ground state of vacuum diagrams survives. Notice, in particular, that the free 
energy, in this limit, is proportional to /?, since the trace is dominated by a 
single state, that of lowest energy. 



15.9 AdS and QCD 517 


Now assume that, for sufficiently high temperatures, the system deconfines. 
By asymptotic freedom, we have free quarks and gluons contributing to the 
sum over states. Because the number of free species grows as N 2 , they will 
contribute to the sum by an order N 2 . If we consider a conformal theory on 
Si x 53 , with the two radii given by P and P\ respectively, then the free energy, 
because of conformal invariance, is a function of the ratio P/P'. The free energy 
is proportional to N 2 times the volume of S 3 , or N 2 (P') 3 . We are interested in 
the limit p' 00, so the background has topology Si x R^. But since the free 

energy is only a function of P/P', this also means, as we take the P' —► 00 
limit, that it is a function of N 2 /p 3 . Thus, in the deconfining phase, we have 

!< 159 . 13 ) 

Now let us compare this with the thermodynamics of anti-de Sitter space. 
The assumption of duality, when applied to the partition function, means 

Z C¥T (M) ~ e ~ lAdS , (15.9.14) 

where the left-hand side represents the parition function over the conformal 
field theory on a four-dimensional manifold Af, while the right-hand side 
can be approximated by the classical supergravity action taken over anti-de 
Sitter space. Although the left-hand side involves a very complicated sum 
over all quantum states of the conformal field theory, the right-hand side is 
approximated by the classical action. 

Let us calculate e~ lAdS for the X\ and X 2 spaces. (One complication is that, 
when performing the thermodynamic sum over states, we must sum over spin 
structures as well as bosonic modes. Around Si, spinors may be periodic or 
antiperiodic. For X x , we must add the contributions of both spin structures, 
i.e., these spin structures contribute to both Tr e~^ H and Tr 1) F where 

F is the fermion number.) We notice that in performing the integral, we must 
integrate over the volume of the space, which is P, the radius of Si. P thus occurs 
linearly in the summation. Therefore we have F ~ PE. But this is the same 
relationship we found earlier for a confined thermodynamic system. Therefore 
we argue that the low-temperature phase, for large p, exhibits confinement. 

For the second space X 2 , we note that we have integrals over R 2 x Sd~\. 
(Notice that we must sum over spinors defined over R 2 . This restricts the sum 
to just one spin structure, i.e., only antiperiodic Neveu-Schwarz spinors can 
be defined over a disk. X 2 contributes to the sum in Tr e ~^ H , but not the sum 
Tr e~P H (— 1) F , This means that there will be a phase transition in the first sum, 
but not the second, when going from low temperature to high temperature.) 
We can show that the large N behavior of the free energy is consistent with 
deconfinement. In fact, since we are dealing with the classical action I A as of 
supergravity, we can actually calculate the difference between the classical 
actions of X x and X 2 to show the existence of a phase transition as we go to 
high temperature. 



518 15. D-Branes and CFT/ADS Duality 

In summary, using CFT/ADS duality we can sketch the nonperturbative 
behavior of gauge theory on Si x S 3 , which has the desired thermal behavior. 


15.10 Summary 

p-Branes, and especially D-branes, have proven to be an integral part of our 
analysis of M-theory. There are several ways in which we can represent D- 
branes. The simplest is to couple the p-brane to a background supergravity 
field 


S = hj d ° x ^{ R ~ ~ my. e ~ amF U 2 ) , 05.10.1) 

where F p + 2 is the usual antisymmetric field strength corresponding to the field 
which couples to the p-brane, and 0 is the usual dilaton. 

This can be coupled to the p-brane action. We break up the vector X M — 
y m ), with jj, = 0, 1,p representing the membrane coordinates, and 
m = p + 1,10. We introduce the variable X M (§), which is now a function 
of the variables § which parametrizes the p-brane world volume 

S P = T j dP + 'S [\yfgg ij d,X M djX N gMNe a * /(p+l) + 

- KX M '...d ip+i X M ^A Mu ... Mp+ )j , (15.10.2) 

where g lj is the metric on the world volume of the p-brane. 

In 11 dimensions, we are particularly interested in the solution for the 
membrane and the 5-brane. The membrane solution is given by 

/ K \“ 2/3 / K \ 1/3 

ds 2 = ( 1 H— dx^dx^+yl^i — -j (dy 2 + y 2 dQ 2 ) , (15.10.3) 

where dQ 7 is the volume form for the S 7 sphere, and the four-form field strength 
is proportional to the dual of the volume form on S 7 . 

We can also construct the action itself for these p-branes. 

Let us introduce 

nf = diX* - wr* d t e, (15.10.4) 

where 0 is a spinor defined in D-dimensional space. 

Then the first part of the action is given by a simple generalization of the 
Nambu-Goto action 


Si = -T I d^'cTy /-det fl, • fl 7 . 


(15.10.5) 



15.10 Summary 519 


Although this action is globally supersymmetric, it is not locally supersym¬ 
metric. To remedy this, we introduce the term 

h = der^ p de. ( 15 . 10 . 6 ) 

The point of introducing h is that we can now introduce a (p — l)-form b , 
where 


h — db , 


(15.10.7) 


where we demand that dh = 0. 

Then the Wess-Zumino action is given by 

S 2 = -2 T j *b = j d p+x a (15.10.8) 

If we just have the p-brane bosonic variable and 6 , then only a limited class 
of p-brane actions can be constructed. In particular, we cannot construct the D- 
brane action or the 5-brane action with must X M and 0. To solve this problem, 
we must introduce higher tensors on the world volume. In particular, the 5- 
brane action in 11 dimensions can be constructed if we introduce a tensor field 
in addition to Similarly, the D-brane actions can be written if we introduce 

vector fields on the world volume. 

Originally, D-branes were introduced to analyze the T duality of bosonic 
strings. In conformal language, T duality is implemented via 

X^z) + X^z) -> X^z) - X^z). (15.10.9) 

If we apply this to an open bosonic string, we find that the usual Neumann 
boundary conditions are transformed into Dirichlet boundary conditions, i.e., 
the endpoints are fixed. In general, if many dimensions are compactified, then 
a T -duality transformation will fix the endpoints of the open string to lie on 
some hyperplane. Since we expect general covariance to make this hyperplane 
curve in space-time, we find that this hyperplane defines a p-brane, which 
we christen the D-brane. One can, in fact, define the D-brane as a p-brane on 
which open strings can end. 

In general, if we compactify k dimensions in a D-dimensional string theory, 
T duality leads to a D p-brane with p = D — 1 — k. In general, Type IIA(B) 
strings have even (odd) D p-branes. 

Now let us analyze N D-branes and introduce Chan-Paton isospin factors. 
In general, open strings can start on one D-brane and end on another, so we 
can have a network of N 2 strings. Each D-brane has a(/(l) vector field, so we 
have £7(1)^ symmetry. 

But now let these N D-branes become coincident. Then the mass operator 
M 2 picks up additional massless vector states. These extra vector states, in 
fact, allow us to generalize the £7(1)^ symmetry group to U ( N ). 



520 15. D-Branes and CFT/ADS Duality 


To see this in more detail, we can start with the coupling of the string X p to 
a background vector field A„, treating the system like a sigma model 

S= j dsJ2A m (x°,...,x p )d t X m 

J m= 0 

/ 25 

ds A i {x\...,x p )d n X i , (15.10.10) 

/'=/>+1 

where the massless vector field only depends on x°,..., x p , the coordinates 
describing the p-brane. We compactify the coordinates labeled by i. One 
important point is that A t describes the fluctuations transverse to the brane. 
Because of this, the /7-brane is dynamical, rather than being a fixed hyper¬ 
plane. As before, one integrates out the higher string modes, imposing the beta 
function relation fi = 0. This, in turn, gives us the equations of motion of the 
background fields, including the t/(l) field. By explicitly performing all these 
steps, we find the D-brane action given by the Dirac-Bom-Infeld action 

T j d p+x <jy/—&et(Gij + F u - By), (15.10.11) 

where G and B are the usual terms formed from the pull-back to the membrane 
surface, i.e., G t j — 3 l / /x 3 y / v g Atv , and where represents the value of 
at the boundary of the p-brane for the Dirichlet conditions. From this, we 
can show, for example, that the low-energy limit of a 3-brane in the Type IIB 
theory yields a gauge theory defined on the four-dimensional world volume. 
This theory, in fact, is just the N = 4 super-Yang-Mills theory. 

There are many applications of D-branes to M-theory. First, we notice that 
in the infinite momentum frame, the D 0-branes dominate the BPS algebra. 
More specifically, we find that M-theory in the infinite momentum frame is 
equivalent to the N oo limit of N coincident D 0-branes, given by U(N) 
super-Yang-Mills theory (defined in 10 dimensions). 

This is the conjecture behind M(atrix)-theory, which gives us a nonpertur- 
bative definition of M-theory in terms of 10-dimensional D 0-branes in the 
infinite momentum frame (where all the other contributions to M-theory are 
vanishingly small). 

Checks of matrix models support this conjecture. For example, the scattering 
of 11-dimensional supergravitons can be simulated by the scattering of 10- 
dimensional D 0-branes in the TV -> oo limit. In addition, for fixed N, we can 
write a relationship between M-theory defined in the discrete light cone limit 
and D 0-branes. 

We can also use D-branes to calculate the entropy of black holes. If we start 
with a Type IIB theory, we have D 1-branes and D 5-branes. If we compactify 
five dimensions, and calculate the 1-brane states connecting the D 1-brane and 
D 5-brane, then the excitations can be counted by calculating the excitations of 
an ordinary string. Thus, the entropy of a black hole can simply be calculated 



15.10 Summary 521 


by taking the logarithm of the parition function of an ordinary string, which 
is well known. This calculation gives a statistical mechanical proof of the 
Bekenstein-Hawking radiation formula. 

Lastly, another application of D-branes is to anti-de Sitter duality. Malda- 
cena conjectured that the strong coupling limit of AT = 4 super-Yang-Mills 
theory in four dimensions is dual to the weak coupling limit of 10 -dimensional 
superstring theory. This remarkable theory links the nonperturbative region of 
gauge theories like QCD to classical supergravity. 

We start with the metric for N coincident Type IIB p-branes 


ds 2 — f l/2 (-dt 2 + dx dx) 4- f l/2 (dr 2 + r 2 d£l\_ p ), 

where 


/ = 1 + (%£) 


U = 


= a g 2 sf°- p)l \ 


(15.10.12) 


(15.10.13) 


Now, the crucial step in this analysis is to go to the near-horizon of the 
geometry. We can go to r ^ 0 by by setting a' -> 0 and holding U fixed. When 
this is done, the 1 appearing in / can be removed, and the metric simplifies 
considerably. By setting p — 3, we find 


ds 2 


a 


-j=U 2 (-dt 2 + dudx) + sf\U~ 2 dU 2 + VkdQ 2 J , 


(15.10.14) 


where X = gyu^- Notice that the string coupling is a constant e * = g s . 

Now, let us make one last change of variables z = \fxj U. The metric takes 
the final form 




dxdx) + dz 2 


+ dQ 2 


(15.10.15) 


The common factor in front of the equation, given by a'\/X, means that the 
radii of the two spaces are the same R = X 1 / 4 (a / ) 1/2 * 

Notice that this is the metric for AdS 5 x S 5 . 

But since the Type IIB 3-brane has a D-brane action given by four¬ 
dimensional super-Yang-Mills theory, we now have established a link between 
the strong coupling limit of Af = 4 super-Yang-Mills theory in four dimensions 
and classical supergravity in 10 dimensions. 

Similarly, the D = 11 supergravity 5-brane on AdSj x S 4 is dual to a six¬ 
dimensional supergauge theory with (0, 2) supersymmetry. And the D = 11 
supergravity membrane on AdS 4 x S 7 is dual to a two-dimensional supergauge 
theory. 



522 15. D-Branes and CFT/ADS Duality 


To check the correctness of this rather unusual duality, we must count states. 
For the AdSs x S 5 duality, we can count states by arranging them in super- 
multiplets. The isometry group of this manifold is SO( 4, 2) x SO( 6), which 
in turn is a subgroup of OSp( 4|8). Thus, the states of this four-dimensional 
theory form representations of £>Sp(4|8). In addition, these states live on the 
four dimensional boundary of AdS 5 . 

To establish the link between the states living on the boundary and states 
living in the bulk of anti-de Sitter space, we make the observation that the 
masses of the states which propagate in the bulk of anti-de Sitter space appear 
in the Green’s function of states which live exclusively on the boundary of the 
anti-de Sitter space. Specifically, we find that the masses of these states defined 
in the bulk appear in the dimension of the states defined on the boundary. 

For example, we find that the Green’s function between two states in the 
bulk is 


K = 


d-{-k+ 

*o_ 


(*0 4- Ixl 2 )^^ ’ 


(15.10.16) 


which in turn is reflected in the correlation function between states defined on 
the boundary, written in terms of the dimension of the fields 


( 15 . 10 . 17 ) 

In this fashion, we can read off the relationship between the mass of the particle 
moving in the bulk and the dimension of the field living on the boundary 


A = \{d + Vrf 2 + 4m 2 ). (15.10.18) 

Since we know the conformal dimensions of the fields on anti-de Sitter 
space, we can check this equation, and we find agreement. [30-3] 

We can also use this formalism to indicate that gauge theory (without su¬ 
persymmetry) should exhibit confinement and a mass gap. We can compactify 
some of the dimensions, breaking supersymmetry in the process. Then we can 
calculate the free energy F(f}) of the resulting theory using CFT/ADS duality, 
and we find qualitative agreement with the expected picture: confinement at 
low energy and deconfinement at high energy. 


15.11 Conclusion 

String theory has gone through a series of incarnations, and may undergo 
several more before its true nature is revealed. It was bom in the late 1960s when 
the Veneziano-Suzuki formula emerged as a promising but mysterious way 
to describe hadron-hadron collisions. Soon, Nambu showed that a vibrating 
string lay at the center of this remarkable formalism. Sadly, the theory died in 
the early 1970s with the rise of QCD as the most viable candidate for a theory 



15.11 Conclusion 523 


of strong interactions. Even the proposal by Scherk and Schwarz to reinterpret 
it as a theory of quantum gravity could not revive the theory. 

The theory was reborn in the mid 1980s, when the work of Green, Schwarz, 
and Witten showed that the theory was anomaly free and provided a remarkable 
formalism in which to unite all fundamental interactions. Within a short period 
of time, millions of classical vacua of string theory were found, some of them 
remarkably close to our physical world. This gave rise to speculation that the 
“theory of everything” was close at hand. However, the failure to find any 
perturbative vacua that described the Standard Model was a great source of 
disappointment. 

In the late 1990s, the application of duality to string theory has created a 
wealth of new information concerning the nonperturbative behavior of string 
theory, allowing one to unify all five superstring theories into a single theory. 
The discovery of Witten and Townsend that a new, mysterious theory, called 
M-theory, lurks in the eleventh dimension, which includes membranes and 5- 
branes, has revealed the richness and unexpected complexity of string theory. 
String theory, in some sense, is no longer a proper name to describe this wealth 
of p-branes emerging from the nonperturbative realm. 

Given the remarkable twists and turns of the past, it is impossible to say 
where string theory will go in the next decade. It seems that the more we know 
about string theory or M-theory, the more beautiful and intricate it becomes. 

Ramond draws a parallel with archeologists who are searching the desert 
for artifacts and accidentally stumble upon a tiny, shiny pebble. When they 
carefully brush away the sand from the pebble, they find that it is not a pebble 
at all, but the tip of a colossal pyramid. Excitedly, the archeologists remove the 
sand and dirt, revealing a complex and rich network of tunnels, secret chambers, 
and hidden rooms. However, with each layer of sand they remove, they find 
even more layers and riches. Where will it all end, they ask themselves? Finally, 
after years of effort, they finally have excavated what appears to be the ground 
floor of the pyramid. They find what appears to be the entrance to the pyramid. 
With trepidation and excitement, they open the door. 

Are we, like the archeologists, about to open the door, marked “duality?” 
Or will we find yet another layer beneath the present one? Only time will tell. 

Witten likes to draw the parallel with Einstein’s theory of general relativity. 
The essential physical principle behind general relativity is the Equivalence 
Principle. This principle lies at the heart of all its remarkable power and el¬ 
egance. The question is: what is the counterpart of the Equivalence Principle 
for string theory? 


References 


1. For a review, see M. J.Duff, R. R.Khuri, and J. X. Lu, Phys. Rep. 259,213 (1995). 

2. M. J. Duff and K. Stelle, Phys. Lett. B253,113 (1991). 



524 


15. D-Branes and CFT/ADS Duality 


3. R. Gueven, Phys. Lett. B276,49 (1992). 

4. A. Achucarro, J. Evans, R Townsend, and D. Wiltshir, Phys. Lett. 198B 441 
(1987). 

5. B. E. Bergshoeff, E. Sezgin, and P. K. Townsend, Phys. Lett. B189, 75 (1987); 
Ann. of Phys. 185, 300 (1988). 

6. M. Aganaic, J. Park, C. Popescu, and J.H.Schwarz, hep-th/9701166. 

7. P. Pasti, D. Sorokin, and M. Tonin, Phys. Rev. D52,4277 (1995); hep-th/9701149. 

8. J. Dai, R. G. Leigh, and J. Polchinski, Mod. Phys. A4,2073 (1989). 

9. R. G. Leigh, Mod. Phys. Lett. A4,2767 (1989). 

10. E. Witten, Nucl. Phys. B460, 335 (1995). 

11. P. K. Townsend, Phys. Lett. B373, 68 (1996). 

12. C. Schmidhuber, Nucl. Phys. B467, 146 (1996). 

13. T. Banks, W. Fischler, S. H. Senker andL. Susskind,F’Ays. Rev. D55,5112 (1997). 

14. For a review of matrix models, see A. Bilal, hep-th/9710136. 

15. U. H. Danielsson, G. Ferretti, and B.Sundborg, hep-th/9603081 (B.8). 

16. D. Kabat and P. Pouliot, hep-th/9603127 (B.9) 

17. M.R.Douglas, D. Kabat, P. Pouliot, and S. Shenker, hep-th/9608024. 

18. K. Becker and M. Becker, hep-th/9705091. 

19. B. de Wit, M. Luscher, and H. Nicolai, Nucl. Phys. B305 [FS23], 545 (1988). 
20.1. J. Bekenstein,Ze«. Nuovo Cimento4, 737(1972 );Phys. Rev. D7,2333 (1973); 

Phys. Rev. D9, 3292 (1974); Phys. Rev. D12,3077 (1975). 

21. S. Hawking, Nature, 248, 30 (1974); Comm. Math. Phys. 43, 199 (1975); Phys. 
Rev. D13, 191 (1976). 

22. C. Vafa and A. Strominger, Phys. Lett. B383,44 (1996). 

23. G. Horowitz, gr-qc/9604051. 

24. J. M. Maldacena and A. Strominger, hep-th/9603060. 

25. J. M. Maldacena, “The Large N Limit of Superconformal Field Theories from 
Supergravity” hep-th/9711200. 

26. P. A. M. Dirac, J. Math. Phys. 4, 901 (1963). 

27. C. Fronsdal, Phys. Rev. D26,1988 (1982). 

28. M. Flato and C. Fronsdal, J. Math. Phys. 22,1100 (1981). 

29. W. Heidenreich, Phys. Lett. B110,461 (1982). 

30. E. Witten, “Anti-de Sitter Space and Holography,” hep-th/9802150. 

31. E. Witten, “Anti-de Sitter Space, Thermal Phase Transition, and Confinement in 
Gauge Theories,” hep-th/9803131. 

32. S. S. Gubser, I. R. Klebanov, and A. M. Polyakov, “Gauge Theory Correlators 
from Noncritical String Theory,” hep-th/9802109. 

33. C. Csaki, H. Ooguri, Y. Oz, and J. Tering, “Glueball Mass Spectrum from 
Supergravity,” hep-th/9806021. 



Index 


catastrophe theory, 215 
c = 1 theory, 225 
modular invariants, 118-9 
SU(2) k , 116 
A-D-E classification, 
c = 1 theory, 222-226 
catastrophe theory, 215 
SU(2) k , 116 
Action, 

BRST string field theory, 331 
Light cone string field theory, 321, 
324 

Nonpolynomial, 370-3 
point-particle, 317 
AdS, 513 

AdSand QCD,514 
Alexander polynomial, 245-6, 253 
Almost complex manifold, 138 
Ambient isotopic knots, 249 
Anomaly, 

conformal, 432 
Wendt, 345-6 

anti-de Sitter space, 509-511, 514, 
521,522 

Ashkin-Teller model, 65, 183-5 
asymptotic freedom, 440 
asymptotically free, 439 


Bekenstein-Hawking, 458, 504, 521 
Betti number, 474-6, 480 
black holes, 5, 504 
Bom-Infeld, 496, 498, 500, 520 
Bosonization, 74 
Boussinesq hierarchy, 502 
BPS, 429, 430, 442, 444, 464, 

468-470, 472, 474, 482, 483, 
484, 487, 491-493 
BPZ bootstrap, 38-66 
Braid group, 

Artin’s braid relations, 247 
Yang-Baxter, 259-60 
BRST, 

field theory, 10, 15-7, 25 
operator, 326-7 
string field theory, 330-8 
topological field theory, 481-6 
BV quantization, 29, 330, 

Calabi-Yau, 452,481, 485 
Calabi-Yau manifold, 
compactification, 136-9 
N = 2 theory, 150-3 
weighted, 225-6 
Yukawa couplings, 139 
Cartan subalgebra, 76 
Cartan-Weyl, 76-7 



526 Index 


Catastrophe theory, 204, 221-4 
Central charge, 
boson, free, 44 
coset, 154 
external charges, 76 
fermion, free, 44 
Landau-Ginzburg, 214-6 
minimal model, 55 
N = 2, 150-2 
parafermions, 86 
SU(2) k , 80 
Sugarawa form, 73 
superconformal minimal model, 
63, 149-50 
supercoset, 213 
unitary series, 56 
Virasoro algebra, 13,43, 46 
WZW, 73, 86 
Centralizer, 270, 276 
CFT/ADS duality, 500, 506, 507, 
518,522 

Chan-Paton factors, 31 
Characters, 
boson, 106 
chiral ring, 220 
fermion, 103-5 
irreducible, 100 
minimal model, 109-10 
SU( 2)*, 115 
Chem-Simons form, 

Chem-Simons theory, 204-5 
Floer complex, 480 
knot theory, 240,242,471 
string field theory, 331 
Chiral rings, 217-219 
Closed strings, 

four-point function, 20-1 
quantization, 20 

string field theory, 34-5, 345-87 
Cocycle, 77 
Cohomology, 

BRST, 10, 15-7, 330-8, 481-6, 
equivariant, 494 

topological field theory, 485,493 
Compactification, 70-2 
confinement phase, 433, 453 
confining phase, 432 
Conformal anomaly, 432 
Conformal blocks, 50-1, 242 


Conformal field theory, 

BPZ bootstrap, 38-66 
fusion rules, 47-9, 58-61, 123-6 
modular invariance, 119-22, 158 
rational conformal field theory, 
52, 86, 91-2, 127-30 
Conformal gauge, 11 
Conformal group, 38-49 
Conformal weight, 
external charge, 77 
Landau-Ginzburg, 216 
minimal model, 53 
N = 2 fields, 149-50 
parafermions, 153-8 
primary field, 41 
simple current, 120 
SU(2) k , 268 
superconformal, 63 
Conformal spin, 41 
Correlation length, 173, 197 
Coset construction, 95-6 
CP 4 , 138, 152 
CP N , 138,225 
Critical exponents, 173 
Current blocks, 82-5 
Curvature tensor, 302 

Dl-branes, 505 
D5-brane, 505 

D-brane, 5, 472, 491-493, 495, 
497-500, 508, 518-521 
D-Branes and CFT/ADS Duality, 
487 

D n , 118-9, 223,232 
Dedekind function, 100-1, 157 
Dehn twist, 98-100 
de Rahm cohomology, 472-3 
Disorder parameter, 57 
Donaldson polynomials, 471, 484, 
487, 489 

Double scaling limit, 431, 436 
doubleton, 514 
duality, 8, 429, 440, 481 
dyon, 445 

E s <g> E 8 , 457-459, 461, 464, 466, 
477,478, 484 
£«, 6, 118-9,222, 232 
£ 7 , 222 



Index 527 


£ 8 ,6, 32, 118-9, 222, 224, 232 
£ 8 0 Zs 8 , 6, 32, 36 
Equivariant cohomology, 493 
Euler number, 139, 334, 473-4 
Energy-momentum tensor, 12, 23, 
42-4,333 

External charge, 75-6 
F 2 , 126 

F-theory, 480, 481,486 
Faddeev-Popov quantization, 
determinant, 326 
BRST, 15-6, 25 
topological field theory, 483 
Feigin-Fuchs free fields, 205-214 
Fermi gas, 456 
Fermi level, 456-7 
fiberwise, 479 

fibration, 479, 480,485, 486 
First quantization, 34, 317 
Fishnet diagrams, 430 
5-brane, 471, 474-476, 491, 492, 
497, 501,507,510,518-521 
Floer theory, 477-81 
Framing, 253 
Free energy, 173, 436 
Frenkel-Kac construction, 76-80 
Freund-Rubin, 506 
Fusion rules, 
diagonalizing, 123-6 
Ising model, 92-3 
F 2 , G 2 , 126 

minimal models, 58-61 
primary fields, 47-9 
SU(2) k , 124 

G 2 , 126 
Gauge fixing, 

conformal gauge, 11 
Siegel, 329 

topological field theory, 482-3 
Gelfand-Dikii relations, 446, 502 
Ghosts, 
current, 339 

Faddeev-Popov, 15-6,483 
string field theory, 327, 332, 345 
topological field theory, 491 
G-parity, 107 
Grand unification, 6 


Gravity, 

topological field theory, 493-7 
two-dimensional, 431-5 
GKO construction, 80-2, 116, 204, 
211-3 
GS model, 
action, 22, 27-30 
GSO projection, 27, 107-8, 221 
Gupta-Bleuler quantization, 10-3 

Hard hexagon model, 184, 191-2 
Harer coordinates, 348 
Hecke algebra, 272-5 
Heisenberg model, 183 
Hermitian spaces, 161-5 
heterotic, 457, 459, 461, 466, 473, 
476 

heterotic string, 6, 32, 470, 485 
Hexagon relation, 92 
Higgs phase, 432, 433, 453 
Highest weight state, 45 
Hirota function, 499 
holomorphic function, 442, 452 
holomorphic Potentials, 430 
holomorphic structure, 448 
holomorphic superpotentials, 440, 
452 

Holonomy group, 137 
HOMFLY polynomial, 250-1 
Hopf algebra, 269-71 

Ice-type model, 179-80 
Instantons, 478, 483 
Inverse scattering, 193-6 
IRF models, 185-90 
Ising model, 

fusion rules, 92-3 
minimal series, 56-8 
one-dimensional, 174-6 
tricritical, 64 
two-dimensional, 176-8 
Yang-Baxter solution, 190-1 
Z n model, 153 

Jones polynomial, 248-50 

K 3 , 476, 478, 479, 481, 483, 485 
Kac determinant, 53, 62^4, 146-8 



528 Index 


Kac-Moody algebras, 73, 77, 113, 
210 

Kadomtsev-Petviasvili hierarchy, 
501-3 

Kiihler manifold, 138, 146 
Kaluza-Klein, 70-2, 460,463, 464, 
472, 501 

Kauffman polynomial, 251 
KdV equation, 193 
KdV hierarchy, 444-7, 450 
Knot theory, 

ambient isotopy, 249 
braid group, 247 
conformal field theory, 257-63 
framing, 253 

HOMFLY polynomials, 250-1 
Jones polynomials, 248-50 
Kauffman polynomials, 250-1 
Markov moves, 260 
regular isotopy, 249 
Reidemeister moves, 249 
Yang-Baxter relation, 259 
Knizhnik-Zamolodchikov relation, 
84, 257 

Koba-Nielsen variables, 355, 365, 
367 

KPZ quantization, 
conformal gauge 
, 433-5 

light cone, 433 
KSV, 17,21 

Landau-Ginzburg potentials, 204, 
214-6 
Laplacian, 473 
Lax pairs, 195 
Light cone, 
quantization, 13-5 
string field theory, 320-5 
Link, 245 

Linking number, 245 
Liouville, 
mode, 10, 433 
theory, 196 

M(atrix)-theory, 500, 520 
M-theory, 5, 8, 428, 462, 464-^168, 
472, 474, 476, 488, 500, 502, 
506, 507,518, 520, 523 


matrix model, 32,427, 502, 504, 506 
Majorana-Weyl fermion, 27 
Mandelstam map, 332 
Markov moves, 260 
Matrix models, 

KdV hierarchy, 444-7 
three-matrix model, 454-5 
two-matrix model, 454 
D — 1 matrix model, 455-62 
Minimal models, 

BPZ bootstrap, 52 
character, 109-110 
GKO, 81 

minimal series, 55, 81 
modular invariants, 119 
N = 2 146-153 
superconformal, 61 
unitary series, 56 
Modality, 221^ 

Modular group, 100 
Modular invariants, 

Dehn twists, 98 
free fermions, 102 
minimal model, 119 
simple currents, 120-2 
SU(2) k , 118 

superconformal minimal model, 
158 

Moduli space, 

instanton space, 483 
parametrization, 316 
monodromy, 445, 447, 450, 452 
monodromy matrices, 445, 446 
monopole, 429, 432, 444,445,471 
Morse theory, 474-8 
Multimatrix models, 452-6 

Neveu-Schwarz-Ramond model, 

22, 25-7 

Nijenhuis tensor, 146 
Non-linear Schrodinger model, 194 
Non-polynomial string field theory, 
354-387 
Null states, 54-6 

0(32), 494 

Orbifolds, 86, 136, 229-11 
Orbit, 20 

Order parameter, 57 



Index 529 


Orthogonal polynomials, 440-4 

p-brane, 8, 428, 470, 471, 482, 483, 
487, 489, 493,497, 498,518, 
518 

Parafermions, 153-8 
Partition function, 100-1, 172-3 
Penner coordinates, 316 
Pentagon relations, 91-2 
Period matrix, 374 
Pictures, 

degenerate vacuums, 341 
picture changing operators, 342-7 
zero picture, 346 
Planck length, 5 
Polyakov action, 

Polyhedra interaction, 366-8 
Potts model, 58, 182 
Primary field, 41,218 
Prime form, 376 
PSL( 2, Z), 100 
Puncture operator, 
matrix model, 450 
topological field theory, 496, 500 

QCD, 7 

Quantum groups, 265-274 
Quasi-triangular Yang-Baxter 
relation, 270 

R-matrix, 267, 273 
Racah coefficients, 85-92 
Rational conformal field theory, 

BPZ bootstrap, 52 
diagonalizing fusion rules, 123-6 
finite number of primaries, 127-30 
GKO, 80-2 

polynomial equations, 91-2 
types, 86 

Recursion relations, 
matrix models, 439-447 
topological field theory, 498 
Virasoro constraint, 499-501 
Reidemeister moves, 249 
Regular isotopy, 249 
Renormalization group, 
c theorem, 227-8 
Riemann vanishing theorem, 376 
Root vectors, 76-7 


RSOS model, 58, 184-5 

5 duality, 452, 464, 467, 482 
S-T-U dualities, 5 
5 = SL( 2, Z), 466 
Schwartzian, 45, 101 
Schwarz-Christoffel transformation, 
322-3 

Schwinger-Dyson equations, 499 
Screening charge, 206 
Second quantization, 315 
Secondary field, 46-7 
Seiberg, 428, 455 
Shapiro-Virasoro amplitude, 21, 
325, 354, 356-7 
Simple currents, 119-22 
Simply laced Lie algebra, 118 
Sine-Gordon model, 193 
singleton, 512, 513, 514 
Skein relation, 246, 249-51 
SL( 2, R), 46, 465 
5L(2, Z), 100, 430, 442, 443, 447, 
448, 452, 454, 465, 476, 480, 
483,492, 499, 500 
SL(N) q , 273 
50(6), 137 
50(8), 4, 29,31 
50(16), 126 
50(16)0 50(16), 466 
50(32), 31, 36, 457, 458, 461, 464, 
467 

SO(2N% 74, 78 
50(4, 2), 40 
solitons, 192-6,487 
Spectral flow, 217 
Spherical model, 178 
Spin(32), 32, 36 
Spin structure, 102 
Spurious states, 13 
Star-triangle relation, 190 
Statistical models, 

Ashkin-Teller, 65, 183^1 
Hard-hexagon, 184 
Ice-type, 179-80 
IRF, 185-90 

Ising, 64, 56-8, 92-3, 174-8 
RSOS, 58, 184-5 
vertex models, 179-82 
String field theory, 



530 Index 


String (< continued) 

BRST, 325-338 
light-cone, 22, 320-325 
non-polynomial, 354-387 
superstring field theory, 338-347 
String susceptibility, 433 
Strings, types, 30-3, 36 
SU(2) k , 
character, 115 
conformal weight, 268 
modular invariants, 118 
Weyl-Kac, 113 
SU( 2, 2), 40 
SU( 3), 113,448 

SU( 3) (g) SU( 2) <g> U( 1), 6, 32, 163 
Sf7(4), 137-8 
SU(N), 74 
Sugawara form, 73 
Superconformal group, 

A = 1 24, 45, 62 
free fields, 213^4 
N = 2 142 
Supergravity, 4, 31-2 
Supersymmetry, 
space-time, 29-30 
two-dimensional, 22-5, 24-5 
N = 1 and N = 2 relation, 27, 
142-^1 

r duality, 457, 459-461, 474, 475, 
476,519 

T -duality group, 461, 473, 477, 479 
Tadpole diagrams, 379-82 
Temperley-Lieb algebras, 272-6 
Tetrahedron graph, 357-9 
Theta function, 
characters, 103 
conformal maps, 381 
multi-loop, 375-6 
Yang-Baxter, 191-2 
Topological field theory, 468-503 
Transfer matrix, 175-8, 195 
Type I, 457-459, 494 
Type I strings, 466 
Type II strings, 474 
Type IIA, 452, 457-460, 462-465, 
467, 473,476, 478-481,483, 
484, 494, 497, 498 
Type IIA and IIB, 461, 493 


Type IIA(B), 459,519 
Type IIB, 457, 458, 464, 465, 472, 
473, 480, 485, 492, 493, 498, 
499, 507-507, 509, 521 

U duality, 473-475 
{/-duality group, 477 

Veneziano-Suzuki amplitude, 19, 
334, 338 

Verma module, 46, 54, 100, 148 
Vertex models, 

Eight-vertex model, 180-2 
Six-vertex model, 179-80 
Yang-Baxter relation, 187 
Virasoro algebra, 13,43, 73, 76 
Virasoro constraint, 497-503 
von Neumann algebra, 248 

W-algebras, 497-503 
WCP manifolds, 225-6 
Wess-Zumino, 430, 431, 436, 453, 
459, 490 

Weyl group, 112-5 
Weyl-Kac formula, 112-5 
Weyl group, 112 
Weyl reflection, 112 
Weyl spinor, 27 
Wess-Zumino term, 72 
Wilson lines, 240 
Witt algebra, 40 
Witten, 428 

Witten string field theory, 330-4 
World sheet, 9 
Wraith, 251 
WZW model, 
action, 72 

Green’s functions, 83—4 
Kac-Moody algebra, 73 
knot theory, 204-5, 242-4 
quantum groups, 268 

XYZ model, 183 

Yang-Baxter relation, 
braid group, 259 
conformal field theory, 257-9 
IRF, 189 

Markov moves, 260 



Index 531 


quantum groups, 265-7 
solitons, 192-196 
solutions, 185-92 
vertex model, 187 
Yang-Mills theory, 
Chem-Simons, 240,472 


Yukawa couplings, 139-41 

Zamolodchikov otheorem, 227-8 
Zeta function regularization, 101 

Zw, 

model, 182 
parafermions, 153-4 



